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SMAD6 AND USES THEREOF 



Field of the InvenHon 

This invention relates to nucleic acids and encoded polypeptides which interact with the 
TGF-P superfamily receptor complexes and which are a negative regulators of TGF-p 
supcrfamily signalling. The invention also relates to agents which bind the nucleic acids or 
polypeptides. The invention further relates to methods of using such nucleic acids and 
polypeptides in the treatment and/or diagnosis of disease. 



Background of the Inventioq 

During mammalian embryogenesis and aduh tissue homeostasis Iransfomiing growth 
factor P (TGF-P) performs pivotal tasks in interccllulai- communication (Roberts et ah, Growth 
Factors 8:1-9, 1993). The cellular effects of this pleiotropic factor are exerted by ligand-induced 
hetero-oligomerization of two distantly related type I and type II serine/threonine kinase 
receptors, TpR-I and TpRTI, respectively (Lin and Lodish, Trends Cell Biol. 1 1 :972-978, 1993; 
Derynck, Trends Biochem. 5c/.19-:548-553, 1994; Massaguc and Weis-Garcia, Cancer Surv. 
27:41-64, 1996; ten Dijke ei al., Curr Opin, Cell. Biol 8:139-145, 1996). The two receptors, 
which both are required for signalling, act in sequence; TpR-I is a substrate for the constitutively 
active TpR-II kinase (Wrana et at., Nature 370:341-347, 1994; Weisere/ al, EMBO J. 14:2199- 
2208, 1995). 

TGF-p is the prototype of a large family of structurally related proteins that are 
involved in various biological activities (Massague, et al., Trends Cell BioL 7:187-192, 1997; 
Roberts & Spom, in: Peptide growth factors and their receptors. Part I (Sporn, M.B. and 
Roberts, A.B., eds) pp. 319-472, Springer- Verlag, Heidelberg (1990); Yingling et ak, 
Biochim. Biophys. Acta 1242:115-136, 1995). The TGF-p "superfamily" includes activins and 
bone morphogenetic proteins (BMPs) that signal in a similar fashion, each employing distinct 
complexes of type I and type II serine/threonine kinase receptors (Lin and Lodish, 1 993; 
Derynck, 1994; Massague and Weis-Garcia, 1996; ten Dijke etaL, 1996). TGF-p related 
molecules act in environments where multiple signals interact and are likely to be under tight 
spatial and chronological regulation. For example, activin and BMP exen antagonistic effects 
in the development of Xenopus embryos (Graff et al., Cell 85:479-487, 1996). Chordin 
(Piccolo et al., Cell 86:589-598, 1996) and noggin (Zimmerman et al., Cell 86:599-606. 
1996), for example, inhibit the ventralizing effect of BMP4 by binding specifically to the 



wo 98/56913 PCT/US98/12078 

-2- 

ligand. Likewise, follistatin neutralizes the activity of activin (Hemmati-Brivalou et al.. Cell 
77:283-295, 1994). 

Genetic studies of TGF-p-like signalling pathways in Drosophila and Caenorhabdiiis 
elegans have led to the identification of mothers against dpp (Mad) (Sekclsky et ai, Genetics 
5 139:1347-1358, mS) and sma (Savage et ai, Proc. Nat. Acad Sci. USA 93:790-794, 1996) 
genes, respectively. The products of these related genes perform essential functions downstream 
of TGF-P-like ligands acting via serine/threonine kinase receptors in these organisms 
(Wiersdorff cr ai, Development 122:2153:2163, 1996; Newfeld et aL, Development 122:2099- 
2 1 08, 1 996; Hoodless et aL, Cell 85:489-500, 1 996). Vertebrate homologs of Mad and sma have 
10 been termed Smads (Derynck et al. Cell 87:173, 1996) or MADR genes (Wrana and Attisano, 
Trends Genet. 12:493-496,1996). Genetic alterations in and .^/ntfr7^/Z)PC^ have been ' 

found in specific tumor subsets, and thus Smads may function as tumor suppressor genes (Hahn 
et al., Science 271 :350-353, 1996; Riggins et al, Nature Genet. 13:347-349,1996; Eppert et aL, 
Cell 86:543-552, 1996). Smad proteins share two regions of high similarity, termed MHl and 

15 MH2 domains, connected with a variable proline-rich sequence (Massague, Cell 85:947-950, 
1996; Derynck and Zhang, Curr Biol 6:1226-1229, 1996). The C-terminal part of Smad2, when 
fused to a heterologous DNA-binding domain, was found to have transcriptional activity (Liu et 
al, Nature 381:620-623, 1996; Meersseman a/., Mech. Dev. 61:127-1400, 1997). The intact 
Smad2 protein when fused to a DNA-binding domain, was latent, but transcriptional activity was 

20 unmasked after stimulation with ligand (Liu et al., 1996). 

Different Smads specify different responses using functional assays mXenopus, Whereas 
Smad I induces ventral mesoderm, a BMP-like response, Smad2 induces dorsal mesoderm, an 
activin/TGF-p-like response (Graff et al. Cell 85:479-487, 1 996; Baker and Harland, Genes dJ 
Dev. 10:1880-1889, 1996; Thomsen, Deve/o/?/we«r 122:2359-2366, 1996). Upon ligand 

25 stimulation Smads become phosphorylated on serine and threonine residues; BMP stimulates 
Smadl phosphorylation, whereas TGF-p induces Smad2 and Smad3 phosphorylation (Hoodless 
etal, Ce// 85:489-500, 1996; L'metaL, 1996; Eppert a/., 1996; Lechleider (?/ a/,, / Biol 
C/iem. 271:17617-17620, \9%',Y\vigVm^ei al, Proc. NuH Aced Sci. t/i^/l 93: 8940-8944, 1996; 
Zhang etal. Nature 383:168-172, 1996; Macias-Silva et al. Cell 87:1215-1224, 1996; Nakao et 

30 a/., Bwl Chem. 272:2896-2900, 1996). 

Smad4 is a common component of TGF-p, activin and BMP signalling (Lagna et al. 
Nature 383:832-836, 1996; Zhang et al. Curr. Biol 7:270-276, 1997; de Winter et al. 
Oncogene 14:1891-1900, 1997). Smad4 phosphorylation has thus far been reported only after 
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activin stimulation of transfected cells (Lagna et al, 1996). After stimulation with TGF-P or 
activin Smad4 interacts with Smad2 or Smad3, and upon BMP challenge a heteromeric complcK 
of Smad4 and Smadl has been observed (Lagna e/ al, 1996). Upon ligand stimulation, Smad 
complexes translocate from the cytoplasm to the nucleus (Hoodless et aL, 1996; Liu el aL, 1996; 
Baker and Hariand, 1996; Macias-Silva et al, 1996), where they, in combination with DNA- 
binding proteins, may regulate gene transcription (Chen et ai. Nature 383:691-696, 1996). 

Summary of the Invention 

The invention provides isolated nucleic acid molecules, unique fragments of those 
molecules, expression vectors containing the foregoing, and host cells transfected with those 
molecules. The invention also provides isolated polypeptides and agents which bind such 
polypeptides, including antibodies. The foregoing can be used in the diagnosis or treatment of 
conditions characterized by the expression of a Smad6 nucleic acid or polypeptide, or lack 
thereof. The invention also provides methods for identifying pharmacological agents useful in 
the diagnosis or treatment of such conditions. Here, we present the identification of Smad6, 
which inhibits phosphorylation of pathway specific Smad polypeptides including Smad2 and 
Smadl and inhibits the TGF-P superfamily signalling pathway such as the TGF-P and BMP 
signalling pathways. 

According to one aspect of the invention, an isolated nucleic acid molecule is provided. 
The molecule hybridizes under stringent conditions to a molecule consisting of the nucleic acid 
sequence of SEQ ID NO: 1 , The isolated nucleic acid molecule codes for a polypeptide which 
inhibits TGF-P, activin, or BMP signaUing. The invention further embraces nucleic acid 
molecules that differ from the foregoing isolated nucleic acid molecules in codon sequence due 
to the degeneracy of the genetic code. The invention also embraces complements of the 
foregoing nucleic acids. 

In certain embodiments, the isolated nucleic acid molecule comprises a molecule 
consisting of the nucleic acid sequence of SEQ ID N0:3 or consists essentially of the nucleic 
acid sequence of SEQ ID N0:1. Preferably, the isolated nucleic acid molecule consists of the 
nucleic acid sequence of SEQ ID N0;3, 

According to another aspect of the invention, an isolated nucleic acid molecule is 
provided. The isolated nucleic acid molecule comprises a molecule consisting of a unique 
fragment of SEQ ID N0:3 between 12 and 1487 nucleotides in length and complements thereof, 
provided that the isolated nucleic acid molecule excludes sequences consisting only o^'SEQ ID 
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N0:4. In one embodiment, the isolated nucleic acid molecule consists of between 12 and 32 
contiguous nucleotides of SEQ ID NO: 1 , or complements of such nucleic acid molecules. In 
preferred embodiments, the unique fragment is at least 14, 15, 16, 17, 18, 20 or 22 contiguous 
nucleotides of the nucleic acid sequence of SEQ ID N0:1, SEQ ID N0:3, or complements 
thereof 

According to another aspect of the invention, the invention involves expression vectors, 
and host cells transformed or transfected with such expression vectors, comprising the nucleic 
acid molecules described above. 

According to still other aspects of the invention, transgenic non-human animals ai e 
provided. The animals include in certain embodiments the foregoing expression vectors. In 
certain preferred embodiments, the transgenic non-human animal includes a conditional Smad6 
expression vector, such as an expression vector that increases expression of Smad6 in a tissue 
specific, development stage specific, or inducible manner. In other embodiments, the transgenic 
non-human animal has reduced expression of Smad6 nucleic acid molecules. In some 
embodiments, the transgenic non-human animal includes a Smad6 gene disrupted by 
homologous recombination. The disruption can be homozygous or heterozygous. In other 
embodiments, the transgenic non-human animal includes a conditional Smad6 gene disruption, 
such as one mediated by e.g. tissue specific, development stage specific, or inducible, expression 
of a recombinase. In yet other embodiments, the transgenic non-human animal includes a trans- 
acting negative regulator of Smad6 expression, such as antisense Smad6 nucleic acid molecules, 
nucleic acid molecules which encode dominant negative Smad6 proteins, Smad6 directed 
ribozymes, etc. 

According to another aspect of the invention, an isolated polypeptide is provided. The 
isolated polypeptide is encoded by the isolated nucleic acid molecule of any of claims 1, 2, 3, or 
4, and the polypeptide has TGF-P, activin, or BMP signalling inhibitory activity. Preferably, the 
isolated polypeptide consists of the amino acid sequence of SEQ ID N0:2. 

In other embodiments, the isolated polypeptide consists of a fragment or variant of the 
foregoing which retains the activity of the foregoing. 

According to another aspect of the invention, there are provided isolated polypeptides 
which selectively bind a Smad6 protein or fragment thereof, provided thai the isolated 
polypeptide is not a TGF-P supcrfamily receptor, such as a TGF-p, activin or BMP type I 
receptor. The isolated polypeptide in certain embodiments binds to a polypeptide encoded by the 
isolated nucleic acid molecule of any of claims 1, 2, 3, or 4. Preferred isolated polypeptides bind 
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to an epitope defined by a polypeptide consisting of the amino acid sequence of SEQ ID N0:2. 
In other preferred embodiments, isolated binding polypeptides include antibodies and fragments 
of antibodies (e.g., Fab, F(ab)2, Fd and antibody fragments which include a CDR3 region which 
binds selectively to the Smad6 polypeptides of the invention). In still other preferred 
embodiments, the isolated polypeptide is a monoclonal antibody, a humanized antibody or a 
chimeric antibody. 

The invention provides in another aspect an isolated complex of polypeptides. The 
isolated complex includes a TGl'-p superfamily receptor or receptor complex bound to a 
polypeptide as claimed in claim 16. Preferably the isolated complex comprises a polypeptide 
having the amino acid sequence of SEQ ID N0:2. In other preferred embodiments, the receptor 
or receptor complex is selected from the group consisting of TpRl, BMPR-IA, BMPR-IB, ActR- 
lA, a complex of TpRI and TpRII, a complex of BMPR-IA and BMPR-II, a complex of BMPR- 
IB and BMPR4I, a complex of ActR-IA and BMPR-II and a complex of ActR-IA and ActR-IL 

According to still another aspect of the invention, methods for reducing TGF-P 
superfamily signal transduction in a mammalian cell are provided. The methods involve 
contacting a mammalian cell with an amount of an inhibitor of TGF-p superfamily signal 
transduction effective to reduce such signal transduction in the mammalian cell. Preferably the 
TGF-P superfamily signal transduction is mediated by a TGPP superfamily ligand, particularly 
TGF-P 1, activin, Vgl, BMP-4 and/or BMP-7. Other methods arc provided for modulating 
phosphorylation of pathway specific Smads (e.g. Smadl, Smad2, Smad3 and/or Smad5). Certain 
methods are provided for reducing phosphorylation of Smadl or Smad2 in a mammalian cell by 
contacting the mammalian cell with an agent which reduces Smadl or Smad2 phosphorylation, 
respectively. Still other methods are provided for increasing phosphorylation of Smad3 in a 
mammahan cell by contacting the mammalian cell with an agent which increases Smad3 
phosphorylation. In certain embodiments of the foregoing methods, the agent is an isolated 
Smad6 polypeptide, such as a polypeptide encoded by a nucleic acid which hybridizes under 
stringent conditions or the nucleic acid of SEQ ID NO: 1 , or degenerates or complements thereof 

According to still another aspect of the invention, methods for modulating proliferation 
and/or differentiation of a cancer cell are provided. Tlie methods involve contacting a cancer cell 
with an amount of an isolated Smad6 polypeptide as described above, effective to reduce 
proliferation and/or differentiation of the cancer cell. 

The invention in a further aspect provides methods for increasing TGF-p superlamily 
signal transduction in a mammalian cell. The mammalian cell is contacted with an agent that 
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selectively binds to an isolated nucleic acid molecule of the invention or an expression product 
thereof in an amount effective to increase TGF-P superfamily signal transduction. Preferably the 
TGF-p superfamily signal transduction is mediated by a TGpp superfamily ligand selected from 
the group consisting of TGF-p 1, activin, Vgl , BMP -4 and BMP-7. Preferred agents are 
antisense Smad6 nucleic acids, including modified nucleic acids, and polypeptides including 
antibodies which bind to a Smad6 polypeptide including the amino acids of SEQ ID N0;2, and a 
dominant negative variant of the polypeptide of SEQ ID N0:2. 

The invention in still another aspect provides compositions comprising a Smad6 
polypeptide and a pharmaceutically acceptable carrier. 

The invention in a further aspect involves a method for decreasing Smad6 TGF-P 
superfamily inhibitory activity in a subject. An agent that selectively binds to an isolated nucleic 
acid molecule of the invention or an expression product thereof is administered to a subject in 
need of such treatment, in an amount effective to decrease TGpp superfamily signal transduction 
inhibitory activity of Smad7 in the subject. Preferably the TGpp superfamily signal transduction 
is mediated by a TGFp superfamily ligand selected from the group consisting of 1 GF-PU 
activin, Vgl, BMP-4 and BMP-7. Preferred agents are antisense nucleic acids, including 
modified nucleic acids, and polypeptides including antibodies which bind to the polypeptide 
including the amino acids of SEQ ID N0:2, and dominant negative variants of the polypeptide of 
SEQ ID N0:2. 

According to yet another aspect of the invention, methods for treating a condition 
characterized by abnormal BMP activity are provided. The methods include administering to a 
subject in need of such treatment an effective amount of Smad6 or a Smad6 agonist or antagonist 
sufficient to restore the BMP activity to normal. In some embodiments, the condition is selected 
from the group consisting of ossification of the posterior longitudinal ligament and ossification 
of the ligament flavum. 

According to another aspect of the invention, methods for treating a condhion 
characterized by abnormal TGF-P activity arc provided. The methods include administering to a 
subject in need of such treatment an effective amount of Smad6 or a Sniad6 agonist or antagonist 
sufficient to restore the TGF-p activity to normal. In certain embodiments, the condition is 
selected from the group consisting of liver fibrosis including cirrhosis and veno-occlusivc 
disease; kidney fibrosis including glomerulonephritis, diabetic nephropatiiy, allograft rejection 
and HIV nephropathy; lung fibrosis including idiopathic fibrosis and autoimmune fibrosis; skin 
fibrosis including systemic sclerosis, keloids, hypertrophic burn scars and eosinophilia-myalgia 
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syndrome; arterial fibrosis including vascular restenosis and atherosclerosis; central ner\^ous 
system fibrosis including intraocular fibrosis; and other fibrotic diseases including rheumatoid 
arthritis and nasal polyposis. 

In another aspect of the invention, methods for modulating the expression of cyclin A are 
provided. The methods include contacting a cell with Smad6 or an agonist or antagonist thereof, 
in an amount effective to modulate the expression of cyclin A. In some embodiments the cell is 
contacted with Smad6 or an agonist thereof, and the expression of cyclin A is increased. In other 
embodiments, the cell is contacted with an antagonist of Smad6, and the expression of cyclin A 
is decreased. Preferably the antagonist of Smad6 is selected from the group consisting of 
antibodies to Smad6, dominant negative variants of Smad6 and Smad6 anlisense nucleic acids. 

According to another aspect of the invention, methods are provided for identifying lead , 
compounds for a pharmacological agent useful in the diagnosis or treatment of disease associated 
with Smad6 TGF-P superfamily signal transduction inhibitory activity. One set oi' methods 
involves forming a mixture of a Smad6 polypeptide, a TGF-P superfamily receptor or receptor 
complex, and a candidate pharmacological agent. The mixture is incubated under conditions 
which, in the absence of the candidate pharmacological agent, permit a first amount of specific 
binding of the TGF-|3 superfamily receptor or receptor complex by the Smad6 polypeptide. A 
test amount of the specific binding of the TGF-p superfamily receptor or receptor complex by the 
Smad6 polypeptide then is detected. Detection of an increase in the foregoing activity in the 
presence of the candidate pharmacological agent indicates that the candidate pharmacological 
agent is a lead compound for a pharmacological agent which increases the Smad6 TGF-p 
superfamily signal transduction inhibitory activity. Detection of a decrease in the foregoing 
activities in the presence of the candidate pharmacological agent indicates that the candidate 
pharmacological agent is a lead compoimd for a pharmacological agent which decreases Smad6 
TGF-p superfamily signal transduction inhibitory activity. Another set of methods involves 
forming a mixture as above, adding further a pathway specific Smad polypeptide, and detecting 
first and test amounts of TGF-P superfamily induced phosphorylation of the pathway specific 
Smad polypeptide. Detection of an increase in the phosphorylation in the presence of the 
candidate pharmacological agent indicates that the candidate pharmacological agent is a lead 
compound for a pharmacological agent which decreases the Smad6 TGF-p superfamily signal 
transduction inhibitory activity. Detection of a decrease in the foregoing activities in the 
presence of the candidate pharmacological agent indicates that the candidate pharmacological 
agent is a lead compound for a pharmacological agent which increases Smad6 TGF-p 
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superfamily signal transduction inhibitory activity. Preferred Smad6 polypeptides include the 
polypeptides of claim 1 6. Preferably the TGpp superfamily receptor is selected from the group 
consisting of TGPP superfamily type I receptors, TGpp superfamily type II receptors, and 
complexes of TGFp superfamily type I receptors and TGPp superfamily type II receptors- 
Preferred pathway specific Smad polypeptides include Smadl and Smad2. 

According to still another aspect of the invention, methods for increasing phosphorylation 
of Smad3 in a mammalian cell are provided. The methods include contacting the mammalian 
cell with an amount of an isolated Smad6 polypeptide effective to increase phosphorylation of 
Smad3 in the mammalian cell. 

According to another aspect of the invention, methods for reducing hetcromerization of 
Smad2 with Smad3 or Smad4 in a mammalian cell are provided. The methods include 
contacting the mammalian cell with an amount of an isolated Smad6 nucleic acid or polypeptide, 
or an agonist thereof, effective to reduce hetcromerization of Smad2 with Smad3 or Sniad4 in the 
manmialian cell. 

The use of the foregoing compositions, nucleic acids and polypeptides in the preparation 
of medicaments also is provided. 

In the foregoing compositions and methods, preferred members of the TGF-P 
superfamily are TGF-pl, activin, Vgl, BMP-4 and BMP-7, and the preferred pathway specific 
Smad polypeptides are Smadl, Smad2, Smad3 and SmadS, 

These and other objects of the invention will be described in further detail in comiection 
with the detailed description of the invention. 

Brief Description of the Figures 

Fig. 1 is a representation of a photograph which depicts (A) the protein sequence 
alignments of Smad6 (SEQ ID N0:2) with Smads 1-5 (SFQ ID NOs:6-]0) and (B) the tissue 
distribution of mouse and human Smad6 in human (left) and a mouse (right) tissue blots. 

Fig. 2 is a representation of a photograph which shows the binding of Smad6 to type I 
receptors; (A) TpR-I, (B) ActR-IB, (C) BMPR-IB, 

Fig. 3 (panels A-D) is a representation of a photograph which demonstrates the effect of 
Smad6 on the phosphorylation of Smadl, Smad 2 and Smad3. 

Fig. 4 (panels A-C) is a representation of a photograph which demonstrates the effect of 
Smad6 on the hetcromerization of Smad2, Smad3 and Smad4- 

Fig. 5 (panels A-C) is a representation of a photograph which shows the effect of Smad6 
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on transcriptional responses of TGF-p. 

Fig. 6 (panels A-C) is a representation of a photograph which shows the expression of 
Sniad6 following stimulation of cells with BMP-2 (A), BMP-7/0P-1 (B) or TGF-pi (C). 

Brief Description of the Sequences 

SEQ ID N0:1 is the nucleotide sequence of the mouse Smad6 cDNA. 

SEQ ID N0:2 is the amino acid sequence of the mouse Smad6 protein. 

SEQ ID N0:3 is the nucleotide sequence of the coding region of the mouse Smad6 

cDNA. 

SEQ ID N0:4 is the nucleotide sequence of the Smad6-relatcd cDNA having GenBank 
accession number U59914. 

SEQ ID N0:5 is the amino acid sequence of the Smad6-related protein having GenBank 
accession number U59914. 

SEQ ID N0:6 is the amino acid sequence of the Smadl protein, shown in Fig.l . 

SEQ ID NO: 7 is the amino acid sequence of the Smad2 protein, shown in Fig.l. 

SEQ ID N0:8 is the amino acid sequence of the Smad3 protein, shown in Fig.L 

SEQ ID N0:9 is the amino acid sequence of the Smad4 protein, shown in Fig.l. 

SEQ ID NO: 10 is the amino acid sequence of the SmadS protein, shown in Fig.l . 

Detailed Description of the Invention 

It has previously been shown that Smadl, Smad2, Smad3, and Smad5 transduce ligand- 
specific signals. In addition, Smad4 acts as an essential common partner of these ligand- 
specific Smads. In the present work, Smad6 is reported as a member belonging to a third 
class of the Smad family. Smad6 has a quite distinct structure from the Smadl, Smad2, 
Smad3, Smad4 and Smad5, and is believed to be a negative regulator in signaling of the TGF- 
p superfamily. Although not wishing to be bound by a precise mechanism, it is believed that 
the regulatory step by Smad6 is at the receptor level since Smad6 stably associates with the 
type I receptors. 

The present invention in one aspect involves the cloning of a cDNA encoding a Smad6 
protein which interacts with TGF-P superfamily receptors. The TGF-p superfamily members are 
well known to those of ordinary skill in the art and include TGF-Ps, activins, bone 
morphogenetic proteins (BMPs), Vgl, Mullerian inhibitory substance (MIS) and 
growth/differentiation factors (GDFs). The sequence of the mouse Smad6 gene is presented as 
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SEQ ID NO:K and the predicted amino acid sequence of this gene's protein product is presented 
as SEQ ID N0:2. Analysis of the sequence by comparison to nucleic acid and protein databases 
determined that Smad6 has a C-termtnal domain (the MH2 domain) which is related to other 
Smad proteins (Fig.l). 

The invention thus involves in one aspect Smad6 polypeptides, genes encoding those 
polypeptides, functional modifications and variants of the foregoing, useful fragments of the 
foregoing, as well as therapeutics relating thereto. 

Homologs and alleles of the Smad6 nucleic acids of the invention can be identified by 
conventional teclmiques. For example, the human homolog of Smad6 can be isolated by 
hybridizing a probe derived from SEQ ID N0:1 under stringent conditions a human cDNA 
library and selecting positive clones. The existence, size, and tissue distribution of a human 
homolog is demonstrated in the examples by Northern blot. Thus, an aspect of the invention is 
those nucleic acid sequences which code for Smad6 polypeptides and which hybridize to a 
nucleic acid molecule consisting of SEQ ID N0:1, under stringent conditions. The term 
"stringent conditions** as used herein refers to parameters with which the art is familiar. Nucleic 
acid hybridization parameters may be found in references which compile such methods, e.g. 
Molecular Cloning: A Laboratory Manual, J. Sambrook, et al., eds., Second Edition, Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1989, or Current Protocols in 
Molecular Biology, P.M. Ausubel, et al., eds., John Wiley & Sons, Inc., New York. More 
specifically, stringent conditions, as used herein, refers, for example, to hybridization at 65T in 
hybridization buffer (3.5 x SSC, 0.02% Ficoll, 0.02% polyvinyl pyrrolidone, 0.02% Bovine 
Serum Albumm, 2.5mM NaH2P04(pH7), 0.5% SDS, 2mM EDTA). SSC is 0.1 5M sodium 
chloride/0. 15M sodium citrate, pH7; SDS is sodium dodecyl sulphate; and EDTA is 
cthylenediaminetetracetic acid. After hybridization, the membrane upon which the DNA is 
transferred is washed at 2 x SSC at room temperature and then at 0. 1 x SSC/0. 1 x SDS al 
temperatures up to SS^C. 

There are other conditions, reagents, and so forth which can used, which result in a 
similar degree of stringency. The skilled artisan will he familiar with such conditions, and thus 
they are not given here. It will be understood, however, that the skilled artisan will be able to 
manipulate the conditions in a manner to permh the clear identification ol' homologs and alleles 
of Smad6 nucleic acids of the invention. The skilled artisan also is familiar with the 
methodology for screening cells and libraries for expression of such molec ules which then arc 
routinely isolated, followed by isolation of the pertinent nucleic acid mo^e :i le and sequencing. 
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In general, homologs and alleles typically will share at least 40% nucleotide identity 
and/or at least 50% amino acid identity to SEQ ID NO: I and SEQ ID N0:2, respectively, in 
some instances will share at least 50% nucleotide identity and/or at least 65% amino acid identity 
and in still other instances will share at least 60% nucleotide identity and/or at least 75% amino 
acid identity. Watson-Crick complements of the foregoing nucleic acids also are embraced by 
the invention. 

In screening for Smad6 proteins, a Southern blot may be performed using the foregoing 
conditions, together with a radioactive probe. After washing the membrane to which the DNA is 
finally transferred, the membrane can be placed against X-ray film lo delect the radioactive 
signal. 

The invention also includes degenerate nucleic acids which include alternative codons to 
those present in the native materials. For example, serine residues are encoded by the codons ; 
TCA, AGT, TCC, TCG, TCT and AGC. Each of the six codons is equivalent for the purposes of 
encoding a serine residue. Thus, it will be apparent to one of ordinary skill in the art that any of 
the serine-encoding nucleotide triplets may be employed to direct the protein synthesis apparatus, 
in vitro or in vivo, to incorporate a serine residue into an elongating Smad6 polypeptide. 
Similarly, nucleotide sequence triplets which encode other amino acid residues include, but are 
not limited to: CCA, CCC, CCG and CCT (proline codons); CGA, CGC, CGG, CGT, AGA and 
AGG (arginine codons); ACA, ACC, ACG and ACT (threonine codons); AAC and AAT 
(asparaginc codons); and ATA, ATC and ATT (isoleucine codons). Other amino acid residues 
may be encoded similarly by multiple nucleotide sequences. Thus, the invention embraces 
degenerate nucleic acids that differ from the biologically isolated nucleic acids in codon 
sequence due to the degeneracy of the genetic code. 

The invention also provides isolated unique fragments of SEQ ID N0:1 or complements 
of SEQ ID N0:1 . A unique fragment is one that is a 'signature' for the larger nucleic acid. It, 
for example, is long enough to assure that its precise sequence is not found in molecules outside 
of the Smad6 nucleic acids defined above. Unique fragments can be used as probes in Southern 
blot assays to identify such nucleic acids, or can be used in amplification assays such as tho.se 
employing PGR. As known to those skilled in the art, large probes such as 200 nucleotides or 
more are preferred for certain uses such as Southern blots, while smaller fragments will be 
preferred for uses such as PCR. Unique fragments also can be used to produce fusion proteins 
for generating antibodies or determining binding of the polypeptide fragments, as demonstrated 
in the Examples, or for generating immunoassay components. Likewise, unique fragments can 
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be employed to produce nonfused fragments of the Smad6 polypeptides, useful, for example, in 
the preparation of antibodies, in immunoassays, and as a competitive binding partner of the 
TGF-P receptor, activin receptor or BMP receptor and/or other polypeptides which hind to the 
Sraad6 polypeptides, for example, in therapeutic applications. Unique fragments further can be 
used as antisense molecules to inhibit the expression of Smad6 nucleic acids and polypeptides, 
particularly for therapeutic purposes as described in greater detail below. 

As will be recognized by those skilled in the art, the size of the unique fragment will 
depend upon its conservancy in the genetic code. Thus, some regions of SEQ ID NO: 1 and its 
complement will require longer segments to be unique while others will require only short 
segments, typically between 12 and 32 nucleotides (e.g. 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 
23, 24, 25, 26, 27, 28, 29, 30, 31 and 32 bases long). Excluding nucleic acid molecules 
consisting completely of the nucleotide sequence of SEQ ID N0:4 or fragments thereof 
(GenBank accession number U59914) which overlap SEQ ID NO: I , virtually any segment of 
SEQ ID NO: 1 , or complements thereof, that is 1 8 or more nucleotides in length will be unique. 
A fragment which is completely composed of the sequence of SEQ ID N0:4 or fragments 
thereof is one which does not include any of the nucleotides unique to Smad6. Preferred longer 
unique fragments include those which are at least 35, 40, 45, 50, 55, 60, 65, 70, 75, 100, 150, 
200, 250, 300, or 500 nucleotide in length. Particularly preferred are those unique fragments 
drawn completely from the portion of SEQ ID N0:3 which is not overlapped by SEQ ID N0:4. 

The unique fragments of the invention exclude sequences identical with certain particular 
prior art nucleic acids or that are identical to only fragments thereof. It is intended that the 
claims not embrace such molecules which are in the prior art. For example, portions of prior art 
ESTs having GenBank accession numbers AA451501, AA046702, W72479, AAI3I352, 
AA 13 1266, W41 1 1 1, N48277, and the like, which are identical to the Smad6 sequence of the 
invention are not unique fragments of Smad6, Thus, the nucleic acids which consist only of 
these sequences, or which consist only of fragments of these sequences, are considered to be 
within the prior art. Nucleic acids, however, which include any portion of the novel sequence of 
the invention are embraced by the invention, including sequences comprising contiguous 
portions of the novel sequences and the prior art sequences. Such sequences have unexpected 
properties, as described herein. In one embodiment the unique fragment does not include any 
portion of the excluded prior art sequences. Those skilled in the art are well versed in methods 
for selecting such sequences, typically on the basis of the ability of the unique fragment to 
selectively distinguish the sequence of interest from non-Smad6 nucleic acids. A comparison of 
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the sequence of the fragment to those on known data bases typically is all that is necessary, 
although in vitro confirmatory hybridization and sequencing analysis may be performed. 

A unique fragment can be a frinctional fragment. A functional fragment of a nucleic acid 
molecule of the invention is a fragment which retains some functional property of the larger 
nucleic acid molecule, such as coding for a functional polypeptide, binding to proteins, 
regulating transcription of operably linked nucleic acids, and the like. One of ordinary skill in 
the art can readily determine using the assays described herein and those well known in the art to 
determine whether a fragment is a functional fragment of a nucleic acid molecule using no more 
than routine experimentation. 

As mentioned above, the invention embraces antisense oligonucleotides that selectively 
bind to a nucleic acid molecule encoding a Smad6 polypeptide, to increase TGF-P supcrfamily 
signalling by reducing the amount of Smad6. This is desirable in virtually any medical condition 
wherein a reduction of Smad6 is desirable, e,g., to increase TGF-p signalling. 

As used herein, the term "antisense oligonucleotide" or "antisense" describes an 
oligonucleotide that is an oligoribonucleotide, oligodeoxyribonucleotide, modified 
oligoribonucleotide, or modified oligodeoxyribonucleotide which hybridizes under physiological 
conditions to DNA comprising a particular gene or to an mRNA transcript of that gene and, 
thereby, inhibits the transcription of that gene and/or the translation of that mRNA. The 
antisense molecules are designed so as to interfere with transcription or translation of a target 
gene upon hybridization with the target gene or transcript. Those skilled in the art will recognize 
that the exact length of the antisense oligonucleotide and its degree of complementarity with its 
target will depend upon the specific target selected, including the sequence of the target and the 
particular bases which comprise that sequence. It is preferred that the antisense oligonucleotide 
be constructed and arranged so as to bind selectively with the target under physiological 
conditions, i.e., to hybridize substantially more to the target sequence than to any other sequence 
in the target cell under physiological conditions. Based upon SEQ ID NO: 1 , or upon allelic or 
homologous genomic and/or cDNA sequences, one of skill in the art can easily choose and 
synthesize any of a number of appropriate antisense molecules for use in accordance with the 
present invention. In order to be sufficiently selective and potent for inhibition, such antisense 
oligonucleotides should comprise at least 1 0 and, more preferably, at least 1 5 consecutive bases 
which are complementary to the target, although in certain cases modified oligonucleotides as 
short as 7 bases in length have been used successfully as antisense oligonucleotides (Wagner et 
Nature Biotechnol 14:840-844, 1996). Most preferably, the antisense oligonucleotides 
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comprise a complementary sequence of 20-30 bases. Although oligonucleotides may be chosen 
which are antisense to any region of the gene or mRN A transcripts, in preferred embodiments the 
antisense oligonucleotides correspond to N-terminal or 5' upstream sites such as translation 
initiation, transcription initiation or promoter sites. In addition, 3'-untranslated regions may be 
targeted. Targeting to mRNA splicing sites has also been used in the art but may be less 
preferred if alternative mRNA splicing occurs. In addition, the antisense is targeted, preferably, 
to sites in which mRNA secondary structure is not expected (see, e.g., Sainio et al., Cell Mol. 
Neurobioi 14(5):439-457, 1994) and at which proteins are not expected to bind. Finally, 
although SEQ ID NO: I discloses a cDNA sequence, one of ordinary skill in the art may easily 
derive the genomic DNA corresponding to the cDNA of SEQ ID NO: 1 , Thus, the present 
invention also provides for antisense oligonucleotides which are complementary to the genomic 
DNA corresponding to SEQ ID N0:1. Similarly, antisense to allelic or homologous cDNAs and 
genomic DNAs are enabled without undue experimentation. 

In one set of embodiments, the antisense oligonucleotides of the invention may be 
composed of "natural" deoxyribonucleotides, ribonucleotides, or any combination thereof Tliat 
is, the 5' end of one native nucleotide and the 3' end of another native nucleotide may be 
covalently linked, as in natural systems, via a phosphodiester intemucleoside linkage. These 
oligonucleotides may be prepared by art recognized methods which may be carried out manually 
or by an automated synthesizer. They also may be produced recombinantly by vectors. 

In preferred embodiments, however, the antisense oligonucleotides of the invention also 
may include "modified" oligonucleotides. That is, the oligonucleotides may be modified in a 
number of ways which do not prevent them from hybridizing to their target but which enhance 
their stability or targeting or which otherwise enhance their therapeutic effectiveness. 

The term '^modified oligonucleotide" as used herein describes an oligonucleotide in 
which (I) at least two of its nucleotides are covalently linked via a synthetic intemucleoside 
linkage (i.e., a linkage other than a phosphodiester linkage between the 5' end of one nucleotide 
and the 3' end of another nucleotide) and/or (2) a chemical group not normally associated with 
nucleic acids has been covalently attached to the oligonucleotide. Preferred synthetic 
intemucleoside linkages are phosphorothioates, alkylphosphonates, phosphorodithioates, 
phosphate esters, alkylphosphonothioates, phosphoramidates, carbamates, carbonates, phosphate 
triesters, acetamidates, carboxymethyl esters and peptides. 

The term ''modified oligonucleotide" also encompasses oligonucleotides with a 
covalently modified base and/or sugar. For example, modified oligonucleotides include 
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oligonucleotides having backbone sugars which are covalently allached to low molecular weight 
organic groups other than a hydroxyl group at the 3' position and other than a phosphate group at 
the 5' position. Thus modified oligonucleotides may include a 2'-0-alkylated ribosc group. In 
addition, modified oligonucleotides may include sugars such as arabinose instead of ribose. The 
5 present invention, thus, contemplates pharmaceutical preparations containing modified antisensc 
molecules that are complementary to and hybridizable with, under physiological conditions, 
nucleic acids encoding Smad6 polypeptides, together with pharmaceutically acceptable carriers. 

Antisense oligonucleotides may be administered as part of a pharmaceutical composition. 
Such a pharmaceutical composition may include the antisense oligonucleotides in combination 

10 with any standard physiologically and/or pharmaceutically acceptable carriers which are known 
in the art. The compositions should be sterile and contain a therapeutically effective amount of 
the antisense oligonucleotides in a unit of weight or volume suitable for administration to a 
patient. The term "pharmaceutically acceptable" means a non-toxic material that does not 
interfere with the effectiveness of the biological activity of the active ingredients. The term 

15 "physiologically acceptable" refers to a non-toxic material that is compatible with a biological 
system such as a cell, cell culture, tissue, or organism. The characteristics of the carrier will 
depend on the route of administration. Physiologically and pharmaceutically acceptable carriers 
include diluents, fillers, salts, buffers, stabilizers, solubilizers, and other materials which are well 
known in the art. 

20 As used herein, a "vector" may be any of a number of nucleic acids into which a desired 

sequence may be inserted by restriction and ligation for transport between different genetic 
environments or for expression in a host cell. Vectors are typically composed of DNA although 
RNA vectors are also available. Vectors include, but are not limited to, plasmids, phagemids and 
virus genomes. A cloning vector is one which is able to replicate in a host cell, and which is 

25 further characterized by one or more endonuclease restriction sites at which the vector may be 
cut in a determinable fashion and into which a desired DNA sequence may be ligated such that 
the new recombinant vector retains its ability to replicate in the host cell. In the case of 
plasmids, replication of the desired sequence may occur many times as the plasmid increases in 
copy number within the host bacterium or just a single time per host before the host reproduces 

30 by mitosis. In the case of phage, replication may occur actively during a lytic phase or passively 
during a lysogenic phase. An expression vector is one into which a desired DNA sequence may 
be inserted by restriction and ligation such that it is opcrably joined to regulatory sequences and 
may be expressed as an RNA transcript. Vectors may further contain one or more marker 
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sequences suitable for use in the identification of cells which have or have not been transformed 
or transfected with the vector. Markers include, for example, genes encoding proteins which 
increase or decrease either resistance or sensitivity to antibiotics or other compounds, genes 
which encode enzymes whose activities arc detectable by standard assays known in the art (e.g., 
5 luciferase, 6-galactosidase or alkaline phosphatase), and genes which visibly affect the 

phenotype of transformed or transfected cells, hosts, colonies or plaques (e.g., green fluorescent 
protein). Preferred vectors are those capable of autonomous replication and expression of the 
structural gene products present in the DNA segments to which they are operably joined. 

As used herein, a coding sequence and regulatory sequences are said lo be ^'operably'' 

10 joined when they are covalently linked in such a way as to place the expression or transcription 
of the coding sequence under the influence or control of the regulatory sequences. If it is desired 
that the coding sequences be translated into a functional protein, two DNA sequences are said to 
be operably joined if induction of a promoter in the 5' regulatory sequences results in the 
transcription of the coding sequence and if the nature of the linkage between the two DNA 

15 sequences does not (1) resuh in the introduction of a frame-shift mutation, (2) interfere with the 
ability of the promoter region to direct the transcription of the coding sequences, or (3) interfere 
with the ability of the corresponding RNA transcript to be translated into a protein. Thus, a 
promoter region would be operably joined to a coding sequence if the promoter region were 
capable of effecting transcription of that DNA sequence such that the resulting transcript might 

20 be translated into the desired protein or polypeptide. 

The precise nature of the regulatory sequences needed for gene expression may vary 
between species or cell types, but shall in general include, as necessary, 5' non-transcribed and 5' 
non-translated sequences involved with the initiation of transcription and translation respectively, 
such as a TATA box, capping sequence, CAAT sequence, and the like. Such 5' non-transcribed 

25 regulator)' sequences especially will include a promoter region which includes a promoter 

sequence for transcriptional control of the operably joined gene. Regulatory sequences may also 
include enhancer sequences or upstream activator sequences as desired. The vectors of the 
invention may optionally include 5' leader or signal sequences. The choice and design of an 
appropriate vector is within the ability and discretion of one of ordinary skill in the art. 

30 Expression vectors containing all the necessary elements for expression are commercially 

available and known to those skilled in the art. See, e.g., Sambrook et al., Molecular Cloning: A 
Laboratory Manual, Second Edition, Cold Spring Harbor Laborator>' Press, 19S9. Cells are 
genetically engineered by the introduction into the cells of heterologous DNA (RNA) encoding 
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Smad6 polypeptide or fragment or variant thereof. That heterologous DNA (RNA) is placed 
under operable control of transcriptional elements to permit the expression of the heterologous 
DNA in the host cell. 

Preferred systems for mRNA expression in mammalian cells are those such as pRc/CMV 
(available from Invitrogen, Carlsbad, CA) that contain a selectable marker such as a gene that 
confers G41 8 resistance (which facilitates the selection of stably transfected cell lines) and the 
human cytomegalovirus (CMV) enhancer-promoter sequences. Additionally, suitable for 
expression in primate or canine cell lines is the pCElM vector (Invitrogen), which contains an 
Epstein Barr virus (EBV) origin of repHcation, facilitating the maintenance of plasmid as a 
multicopy extrachromosomal element. Another expression vector is the pEF-BOS plasmid 
containing the promoter of polypeptide Elongation Factor la, which stimulates efficiently 
transcription in vitro. The plasmid is described by Mishizuma and Nagata {Nuc. Acids Res. ' 
18:5322, 1990), and its use in transfection experiments is disclosed by, for example, Demoulin 
{Moi Cell. Biol. 16:4710-4716, 1996). Still another preferred expression vector is an 
adenovirus, described by Stratford-Perricaudet, which is defective for El and E3 proteins {J. 
Clin. Invest. 90:626-630, 1992). The use of the adenovirus as an Adcno.Pl A recombinant is 
disclosed by Wamier et al., in intradermal injection in mice for immunization against PI A {Int. ./ 
Cancer, 67:303-310, 1996). 

The invention also embraces so-called expression kits, which allow the artisan to prepare 
a desired expression vector or vectors. Such expression kits include at least separate portions of 
each of the previously discussed coding sequences. Other components may be added, as desired, 
as long as the previously mentioned sequences, which are required, are included. 

The invention also permits the construction of Smad6 gene transgenics and "knock-outs" 
in cells and in animals, providing materials for studying certain aspects of TGF-P superfamily 
signal transduction and the effects thereof on cellular, developmental and physiological 
processes. 

The invention also provides isolated polypeptides, which include the polypeptide of SEQ 
ID NO:2 and unique fragments of SEQ ID N0:2. Smad6 polypeptides are encoded by nucleic 
acids described above, e.g., those which hybridize to SEQ ID NO: 1 . Such polypeptides are 
useful, for example, alone or as fusion proteins to generate antibodies, as a components of an 
immunoassay. 

A unique fragment of a Smad6 polypeptide, in general has the features and 
characteristics of unique fragments as discussed above in connection with nucleic acids. As will 
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be recognized by those skilled in the art, the size of the unique fragment will depend upon factors 
such as whether the fragment constitutes a portion of a conserved protein domain. Thus, some 
regions of SHQ ID N0:2 will require longer segments to he unique (e.g., 20, 30, 50, 75, and 100 
amino acid long) while others will require only short segments, typically between 5 and 12 
amino acids (e.g. 5, 6, 7, 8, 9, 10, II and 12 amino acids long). Virtually any segment of SEQ 
ID N0:2 which is not overlapped by SEQ ID N0:5, and that is 10 or more amino acids m length 
will be unique. A unique fragment of a Smad6 polypeptide excludes fragments completely 
composed of the amino acid sequence of SEQ ID N0;5 which overlaps SEQ ID N0:2. A 
fragment which is completely composed of the sequence of SEQ ID NO:5 is one which does not 
include any of the amino acids unique to Smad6. 

Unique fragments of a polypeptide preferably are those fragments which retain a distinct 
functional capability of tlie polypeptide. Functional capabilities which can be retained in a ' 
unique fragment of a polypeptide include interaction with antibodies, interaction with other 
polypeptides (such as TGF-P superfamily type I receptors) or fragments thereof, selective 
binding of nucleic acids or proteins, and enzymatic activity. Those skilled in the art are well 
versed in metliods for selecting unique amino acid sequences, typically on the basis of the ability 
of the unique fragment to selectively distinguish the sequence of interest from non-family 
members. A comparison of the sequence of the fragment to those on known data bases typically 
is all that is necessary. 

The invention embraces variants of the Smad6 polypeptides described above. As used 
herein, a "variant" of a Smad6 polypeptide is a polypeptide which contains one or more 
modifications to the primary amino acid sequence of a Smad6 polypeptide. Modifications which 
create a Smad6 variant can be made to a Smad6 polypeptide I) to reduce or eliminate an activity 
of a Smad6 polypeptide, such as binding to TpR-I; 2) to enliance a property of a Sniad6 
polypeptide, such as protein stability in an expression system or the stability of protein-protein 
binding; or 3) to provide a novel activity or property to a Smad6 polypeptide, such as addition of 
an antigenic epitope or addition of a detectable moiety. Modifications to a Smad6 polypeptide 
are typically made to the nucleic acid which encodes the Smad6 polypeptide, and can include 
deletions, point mutations, truncations, amino acid substitutions and additions of amino acids or 
non-amino acid moieties. Alternatively, modifications can be made directly to the polypeptide, 
such as by cleavage, addition of a linker molecule, addition of a detectable moiety, such as 
biotin, addition of a fatty acid, and the like. Modifications also embrace f usion proteins 
comprising all or part of the Smad6 amino acid sequence. 
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In general, variants include Sniad6 polypeptides which are modified specifically to alter a 
feature of the polypeptide unrelated to its physiological activity. For example, cysteine residues 
can be substituted or deleted to prevent unwanted disulfide linkages. Similarly, certain amino 
acids can be changed to enhance expression of a Smad6 polypeptide by eliminating proteolysis 
by proteases in an expression system (e.g., dibeisic amino acid residues in yeast expression 
systems in which KEX2 protease activity is present). 

Mutations of a nucleic acid which encode a Smad6 polypeptide preferably preserve the 
amino acid reading frame of the coding sequence, and preferably do not create regions in the 
nucleic acid which are likely to hybridize to form secondary structures, such a hairpins or loops, 
which can be deleterious to expression of the variant polypeptide. 

Mutations can be made by selecting an amino acid substitution, or by random 
mutagenesis of a selected site in a nucleic acid which encodes the polypeptide. Variant 
polypeptides are then expressed and tested for one or more activities to determine which 
mutation provides a variant polypeptide with the desired properties. Further mutations can be 
made to variants (or to non-variant Smad6 polypeptides) which are silent as to the amino acid 
sequence of the polypeptide, but which provide preferred codons for translation in a particular 
host. The preferred codons for translation of a nucleic acid in, e.g., E. coli. are well known to 
those of ordinary skill in the art. Still other mutations can be made to the noncoding sequences 
of a Smad6 gene or cDNA clone to enhance expression of the polypeptide. The activity of 
variants of Smad6 polypeptides can be tested by cloning the gene encoding the variajit Smad6 
polypeptide into a bacterial or mammalian expression vector, introducing the vector into an 
appropriate host cell, expressing the variant Smad6 polypeptide, and testing for a functional 
capability of the Smad6 polypeptides as disclosed herein. For example, the variant Smad6 
polypeptide can be tested for inhibition of TPR-I (and/or activin or BMP receptor) signalling 
activity as disclosed in the Examples, or for inhibition of Smad 1 or SFnad2 phosphorylation as is 
also disclosed herein. Preparation of other variant polypeptides may favor testing of other 
activities, as will be known to one of ordinary skill in the art. 

The skilled artisan will also realize that conservative amino acid substitutions may he 
made in Smad6 polypeptides to provide functionally equivalent variants of the foregoing 
polypeptides, i,e, variants which retain the functional capabilities of the Smad6 polypeptides. As 
used herein, a ''conservative amino acid substitution" refers to an amino acid substitution which 
docs not alter the relative charge or size characteristics of the protein in wliiob the amino acid 
substitution is made. Variants can be prepared according to methods for alt:;ring polyjiepiide 
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sequence known to one of ordinary skill in the art such as are found in references which compile 
such methods, e.g. Molecular Cloning: A Laboratory Manual, J. Sambrook, et a!., eds., Second 
Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1 989, or Current 
Protocols in Molecular Biology^ F.M. Ausubel, et al., eds., John Wiley & Sons, Inc., New York. 
5 Exemplary functionally equivalent variants of the Smad6 polypeptides include conservative 
amino acid substitutions of SEQ ID N0:2. Conser\'ative substitutions of amino acids include 
substitutions made amongst amino acids within the following groups: (a) M, I, L, V; (b) F, Y, W; 
(c) K, R, H; (d) A, G; (e) S, T; (f) Q, N; and (g) D. 

For example, one can make consei*vative amino acid substitutions to the amino acid 
10 sequence of the Smad6 polypeptide using methods known in the art. Exemplary methods for 
identifying functional variants of binding peptides are provided in a published PCT application 
of Stromingcr and Wuchcrpfcnnig (PCT/US96/03 1 82, describes the identification of variants of 
HLA class 11 binding peptides). The described methods can be used to identify Smad6 variants 
which bind XpR-I or other TGF-p superfamily receptors or receptor complexes. These variants 
15 can be tested, e.g., for improved stability and are useful, inter alia, in regulation of TGF-P 
superfamily signalling. 

Conservative amino-acid substitutions in the amino acid sequence of Smad6 polypeptides 
to produce functionally equivalent variants of Smad6 polypeptides typically are made by 
alteration of the nucleic acid encoding Smad6 polypeptides (SEQ ID NO: 1 ). Such substitutions 
20 can be made by a variety of methods known to one of ordinary skill in the art. For example, 
amino acid substitutions may be made by PCR-directed mutation, site-directed mutagenesis 
according to the method of Kunkel (Kunkel, Proc. Nat. Acad Set. U.S.A. 82: 488-492, 1985), or 
by chemical synthesis of a gene encoding a Smad6 polypeptide. Where amino acid substitutions 
are made to a small unique fragment of a Smad6 polypeptide, such as a XpR-I binding site 
25 peptide, the substitutions can be made by directly synthesizing tlie peptide. The activity of 
functionally equivalent fragments of Smad6 polypeptides can be tested by cloning the gene 
encoding the altered Smad6 polypeptide into a bacterial or mammalian expression vector, 
introducing the vector into an appropriate host cell, expressing the altered Smad6 polypeptide, 
and testing for a functional capability of the Smad6 polypeptides as disclosed herein. Peptides 
30 which are chemically synthesized can be tested directly for function, e.g., for binding to TpR-I. 
The invention as described herein has a number of uses, some of which arc described 
elsewhere herein. First, the invention permits isolation of the Smad6 protein molecules (SEQ ID 
N0:2). A variety of methodologies well-known to the skilled practitioner can be utilized to 
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obtain isolated Smad6 molecules. The polypeptide may be purified from cells which naturally 
produce the polypeptide by chromatographic means or immunological recognition. 
Alternatively, an expression vector may be introduced into cells to cause production of the 
polypeptide. In another method, mllN A transcripts may be microinjected or otherwise 
5 introduced into cells to cause production of the encoded polypeptide. Translation of mRNA in 
cell-free extracts such as the reticulocyte iysate system also may be used to produce polypeptide. 
Those skilled in the art also can readily follow known methods for isolating Smad6 polypeptides. 
These include, but are not limited to, immunochromatography, HPLC, size-exclusion 
cliromatography, ion-exchange chromatography and immune-affinity chromatography. 
10 The invention also makes it possible isolate proteins such as TpR-I, AclR-IB and 

BMPR-IB by the binding of such proteins to Smad6 as disclosed herein. The identification of 
this binding also permits one of skill in the art to block the binding of Smad6 to other proteins,' 
such as TpR-I. For example, binding of such proteins can be affected by introducing into a 
biological system in which the proteins bind (e.g., a cell) a polypeptide including a Smad6 TpR-I 
15 binding site in an amount sufficient to reduce or even block the binding of Smad6 and TpR-I. 
The identification of Smad6 binding to TGF-P superfamily receptors also enables one of skill in 
the art to isolate Smad6 amino acid sequences which bind to such receptors and prepare modified 
proteins, using standard recombinant DNA techniques, which can bind to proteins such as TpR-1, 
ActR-IB and BMPR-IB. For example, when one desires to target a certain protein to a TpR-I 
20 receptor complex, one can prepare a fusion polypeptide of the protein and the Smad6 TpR-1 
binding site. AdditionsJ uses are described further herein. 

The invention also provides, in certain embodiments, "dominant negative" polypeptides 
derived from SEQ ID N0:2. A dominant negative polypeptide is an inactive variant of a protein, 
which, by interacting with the cellular machinery, displaces an active protein from its interaction 
25 with the cellular machinery or competes with the active protein, thereb}' reducing the effect of 
the active protein. For example, a dominant negative receptor which binds a ligand but docs not 
transmit a signal in response to binding of the ligand can reduce the biological effect of 
expression of the ligand. Likewise, a dominant negative catalytically-inactivc kinase which 
interacts normally with target proteins but does not phosphorylate the taiget proteins can reduce 
30 phosphor>'lation of the target proteins in response to a cellular signal. Similariy, a dominant 
negative transcription factor which binds to a promoter site in the control region of a gene but 
does not increase gene transcription can reduce the effect of a normal transcription factor by 
occupying promoter binding sites without increasing transcription. 



wo 98/56913 PCT/US98/12078 

-22- 

The end result of the expression of a dominant negative polypeptide in a cell is a 
reduction in function of active proteins. One of ordinary skill in the art can assess the potential 
for a dominant negative variant of a protein, and using standard mutagenesis techniques to create 
one or more dominant negative variant polypeptides. For example, given the teachings contained 
5 herein of a Smad6 polypeptide, one of ordinary skill in the art can modify the sequence of the 
Smad6 polypeptide by site-specific mutagenesis, scanning mutagenesis, partial gene deletion or 
truncation, and the like. See, e.g., U.S. Patent No. 5,580,723 and Sambrook et al.. Molecular 
Cloning: A Laboratory Manual , Second Edition, Cold Spring Harbor Laboratory Press, 1989, 
The skilled artisan then can test the population of mutagenized polypeptides for diminution in a 

10 selected activity (e.g., Smad6 reduction of TGF-p signalling activity) and/or for retention of such 
an activity. Other similar methods for creating and testing dominant negative variants of a 
protein will be apparent to one of ordinary skill in the art. 

Dominant negative Smad6 proteins include variants in which a portion of the binding site 
of Smad6 for a TGF-P superfamily receptor has been mutated or deleted to reduce or eliminate 

15 Smad6 interaction with the TGF-P receptor complex. Other examples include Smad6 variants in 
which the ability to inhibit phosphorylation of Smadl , Smad2 and/or Smad3 is reduced. One of 
ordinary skill in the art can readily prepare Smad6 variants bearing mutations or deletions in the 
C-terminal domain (e.g., in the MH2 domain) or in the N-terminal domain (e.g., in the 
glycine/glutamic acid residue rich region) and test such variants for a Smad6 activity. 

20 The invention also involves agents such as polypeptides which bind to Smad6 

polypeptides and to complexes of Smad6 polypeptides and binding partners such as TpR-I. Such 
binding agents can be used, for example, in screening assays to detect the presence or absence of 
Smad6 polypeptides and complexes of Smad6 polypeptides and their binding partners and in 
purification protocols to isolate Smad6 polypeptides and complexes of Smad6 polypeptides and 

25 their binding partners. Such agents also can be used to inhibit the native activity of the Smad6 
polypeptides or their binding partners, for example, by binding to such polypeptides, or their 
binding partners or both. 

The invention, therefore, embraces peptide binding agents which, for example, can be 
antibodies or fragments of antibodies having the ability to selectively bind lo Smad6 

30 polypeptides. Antibodies include polyclonal and monoclonal antibodies, prepared according to 
conventional methodology. 

Significantly, as is well-known in the art, only a small portion of an antibody molecule, 
the paratope, is involved in the binding of the antibody to its epitope (see, in general, Clark, 
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W.R. (1986) The Experimental Foundations of Modern Imniunolopv Wiley & Sons, Inc., New 
York; Roitt, I. 0991) Essential Immunology , 7th Ed., Blackwell Scientific Publications, 
Oxford). The pFc' and Fc regions, for example, are effectors of the complement cascade but are 
not involved in antigen binding. An antibody from which the pFc' region has been enzymatically 
cleaved, or which has been produced without the pFc* region, designated an F(ab')2 fragment, 
retains both of the antigen binding sites of an intact antibody. Similarly, an antibody from which 
the Fc region has been enzymatically cleaved, or which has been produced without the Fc region, 
designated an Fab fragment, retains one of the antigen binding sites of an intact antibody 
molecule. Proceeding further. Fab fragments consist of a covalently bound antibody light chain 
and a portion of the antibody heavy chain denoted Fd. The Fd fragments are the major 
determinant of antibody specificity (a single Fd fragment may be associated with up to ten 
different light chains without altering antibody specificity) and Fd fragments retain epitopc- 
binding ability in isolation. 

Within the antigen-binding portion of an antibody, as is well-known in the art, there are 
complementarity determining regions (CDRs), which directly interact with the epitope of the 
antigen, and framework regions (FRs), which maintain the tertiary structure of tlic paratope (see, 
in general, Clark, 1986; Roitt, 1991). In both the heavy chain Fd fragment and the light chain of 
IgG immunoglobulins, there are four framework regions (FRl through FR4) separated 
respectively by three complementarity determining regions (CDRl through CDR3). The CDRs, 
and in particular the CDR3 regions, and more particularly the heavy chain CDR3, arc largely 
responsible for antibody specificity! 

It is now well-established in the art that the non-CDR regions of a mammalian antibody 
may be replaced with similar regions of conspecific or heterospecific antibodies while retaining 
the epitopic specificity of the original antibody. This is most clearly manifested in the 
development and use of "humanized" antibodies in which non-human CDRs are covalently 
joined to human FR and/or Fc/pFc' regions to produce a functional antibody. Thus, for example, 
PCT International Publication Number WO 92/04381 teaches the production and use of 
humanized murine RSV antibodies in which at least a portion of the murine FR regions have 
been replaced by FR regions of human origin. Such antibodies, including fragments of intact 
antibodies with antigen-binding ability, ai'e often referred to as ''chimeric'' antibodies. 

Thus, as will be apparent to one of ordinary skill in the art, the present invention also 
provides for F(ab')2, Fab, Fv and Fd fragments; chimeric antibodies in which the Fc and/or FR 
and/or CDRl and/or CDR2 and/or light chain CDR3 regions have been replaced by homologous 
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human or non-human sequences; chimeric F(ab*)2 fragment antibodies in which the FR and/or 
CDRI and/or CDR2 and/or light chain CDR3 regions have been replaced by homologous human 
or non-human sequences; chimeric Fab fragment antibodies in which the FR and/or CDRI and/or 
CDR2 and/or light chain CDR3 regions have been replaced by homologous human or non- 
human sequences; and chimeric Fd fragment antibodies in which the FR and/or CDRI and/or 
CDR2 regions have been replaced by homologous human or non-human sequences. The present 
invention also includes so-called single chain antibodies. 

Thus, the invention involves polypeptides of numerous size and type that bind 
specifically to Smad6 polypeptides, and complexes of both Smad6 polypeptides and their 
binding partners. These polypeptides may be derived also from sources other than antibody 
technology. For example, such polypeptide binding agents can be provided by degenerate 
peptide libraries which can be readily prepared in solution, in immobilized form or as phage . [- 
display libraries. Combinatorial libraries also can be synthesized of peptides containing one or 
more amino acids. Libraries fiirther can be synthesized of peptoids and non-peptide synthetic 
moieties (e.g., peptidomimetics). 

Phage display can be particularly effective in identifying binding peptides useful 
according to the invention. Briefly, one prepares a phage library (using e.g. ml3, fd, or lambda 
phage), displaying inserts from 4 to about 80 amino acid residues using conventional procedures. 
The inserts may represent, for example, a completely degenerate or biased array. One then can 
select phage-bearing inserts which bind to the Smad6 polypeptide. This process can be repeated 
through several cycles of reselection of phage that bind to the Smad6 polypeptide. Repeated 
rounds lead to enrichment of phage bearing particular sequences. DNA sequence analysis can be 
conducted to identify the sequences of the expressed polypeptides. The minimal linear portion of 
the sequence that binds to the Smad6 polypeptide can be determined. One caji repeat tlie 
procedure using a biased library containing inserts containing part or all of the minimal linear 
portion plus one or more additional degenerate residues upstream or downstream thereof. Yeast 
two-hybrid screening methods also may be used to identify polypeptides that bind to the Smad6 
polypeptides. Thus, the Smadfi polypeptides of the invention, or a fragment thereof, can be used 
to screen peptide libraries, including phage display libraries, to identify and select peptide 
binding partners of the Smad6 polypeptides of the invention. Such molecules can be used, as 
described, for screening assays, for purification protocols, for interfering directly with the 
functioning of Smad6 and for other purposes that will be apparent to those of ordinary skill in the 
art. 
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A Smad6 polypeptide, or a fragment thereof, also can be used to isolate their native 
binding partners, including, e.g., the TGF-P receptor complex. Isolation of such binding partners 
may be performed according' to well-known methods. For example, isolated Smad6 polypeptides 
can be attached to a substrate (e.g,, chromatographic media, such as polystyrene beads, or a 
5 filter), and then a solution suspected of containing the TGF-p receptor complex may be applied 
to the substrate. If a TGF-p receptor complex which can interact with Smad6 polypeptides is 
present in the solution, then it will bind to the substrate-bound Smad6 polypeptide. The TGF-P 
receptor complex then may be isolated. Other proteins which are binding partners for Smad6, 
such as other Smads, and activin or BMP receptor complexes, may be isolated by similar 

1 0 methods without undue experimentation. 

It will also be recognized that the invention embraces the use of the Smad6 cDNA 
sequences in expression vectors, as well as to transfcct host cells and cell lines, be these 
prokaryotic (e,g., E. coli), or eukaryotic (e.g., CHO cells, COS cells, yeast expression systems 
and recombinant baculovirus expression in insect cells). Especially useful are mammalian cells 

15 such as human, mouse, hamster, pig, goat, primate, etc. They may be of a wide variety of tissue 
types, and include primary cells and cell lines. Specific examples include keratinocytes, 
fibroblasts, COS cells, peripheral blood leukocytes, bone marrow stem cells and embryonic stem 
cells. The expression vectors require that the pertinent sequence, i.e., those nucleic acids 
described si4pra, be operably linked to a promoter. 

20 The isolation of the Smad6 gene also makes it possible for the artisan to diagnose a 

disorder characterized by aberrant expression of Smad6. These methods involve determining 
expression of the Smad6 gene, and/or Smad6 polypeptides derived therefrom. In the former 
situation, such determinations can be carried out via any standard nucleic acid determination 
assay, including the polymerase chain reaction, or assaying with labeled hybridization probes. 

25 The invention further provides methods for reducing or increasing TGF-P superfamily 

signal transduction in a cell. Such methods are useftil in vitro for altering the TGF-P signal 
transduction, for example, in testing compounds for potential to block aberrant TGF-p signal 
transduction or increase deficient TGF-P signal transduction, /// vivo^ such methods are useful 
for modulating growth, e.g., to treat cancer and fibrosis. Increasing TGF-P signal transduction in 

30 a cell by, e.g., introducing a dominant negative Smad6 polypeptide or Smad6 antisense 
oligonucleotides in the cell, can be used to provide a model system for testing the effects of 
putative inhibitors of TGF-P signal transduction. Such methods also are useful in the treatment 
of conditions which result from excessive or deficient TGF-p signal transduction. TGF-P signal 
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transduction can be measured by a variety of ways known to one of ordinary skill in the art, such 
as the reporter systems described in the Examples. Various modulators of Smad6 activity can be 
screened for effects on TGF-p signal transduction using the methods disclosed herein. The 
skilled artisan can first determine the modulation of a Smad6 activity, such as TGF-(i signalling 
5 activity, and then apply such a modulator to a target cell or subject and assess the effect on the 
target cell or subject. For example, in screening for modulators of Smad6 useful in the treatment 
of cancer, cells in culture can be contacted with Smad6 modulators and the increase or decrease 
of growth or focus formation of the cells can be determined according to standard procedures. 
Smad6 activity modulators can be assessed for their effects on other TGF-P signal transduction 
10 downstream effects by similar methods in many cell types,. 

Thus it can be of therapeutic benefit to administer Smad6 protein or nucleic acid 
encoding a Smad6 protein, or an agonist or antagonist of Smad6, to modulate TGF-p superfainily 
activity in certain conditions characterized by abnormal TGF-P superfamily activity. Specific 
examples of conditions involving abnormally elevated BMP activity include ossification of the 
15 posterior longitudinal ligament (Yonemori et aUAm. J. Pathol. 150:1335-1347, 1997) and 

ossification of the ligament flavum (Hayashi et al.. Bone 21 :23-30, 1 997). Specific examples of 
conditions involving abnormal TGF-p activity include liver fibrosis including cirrhosis and 
veno-occlusive disease; kidney fibrosis including glomerulonephritis, diabetic nephropathy, 
allograft rejection and HIV nephropathy; lung fibrosis including idiopathic fibrosis and 
20 autoimmune fibrosis; skin fibrosis including systemic sclerosis, keloids, hypertrophic burn scars 
and eosinophilia-myalgia syndrome; arterial fibrosis including vascular restenosis and 
atherosclerosis; central nervous system fibrosis including intraocular fibrosis; and other fibrotic 
diseases including rheumatoid arthritis and nasal polyposis, (see, e.g., Border and Noble, A^. 
EngL J. Med. 331:1286-1292, 1994). 
25 An effective amount of Smad6, or an agonist or antagonist thereof, is administered to 

treat the condition, which amount can be determined by one of ordinary skill in the art by routine 
experimentation. For example, to determine an effective amount of Smad6 for treating 
ossification, Smad6 can be administered and the progress of the ossification monitored using 
standard medical diagnostic methods. An amount of Smad6 which reduces the progression of 
30 the ossification, or even halts the progression of the ossification is an effective amount. The 
person of ordinary skill in the art will be familiar with such methods. Other conditions mvolving 
abnormally elevated or reduced TGF-p superfamily activity can he treated in a like manner, by 
administering Smad6 or an agonist thereof, or a Smad6 antagonist, respecri vely, to reduce or 
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elevate the TGF-p superfamily activity into normal ranges as needed. Smad6 antagonists include 
the antibodies to Smad6, dominant negative variants of Smad6 and antisense Smad6 nucleic 
acids described above. Smad6 agonists include agents which increase Smad6 expression, 
binding or activity. 

When administered, the therapeutic compositions of the present invention are 
administered in pharmaceutically acceptable preparations. Such preparations may routinely 
contain pharmaceutically acceptable concentrations of salt, buffering agents, preservatives, 
compatible carriers, supplementary immune potentiating agents such as adjuvants and cytokines 
and optionally other therapeutic agents. 

The therapeutics of the invention can be administered by any conventional route, 
including injection or by gradual infusion over time. The administration may, for example, be 
oral, intravenous, intraperitoneal, intramuscular, intracavity, subcutaneous, or transdermal. 
When antibodies are used therapeutically, a preferred route of administration is by pulmonary 
aerosol. Techniques for preparing aerosol delivery systems containing antibodies are well 
known to those of skill in the art. Generally, such systems should utilize components which will 
not significantly impair the biological properties of the antibodies, such as the paratope binding 
capacity (see, for example, Sciarra and Cutie, "Aerosols," in Remington\s Pharmaceutir;^] 
Sciences, 18th edition, 1990, pp 1694-1712; incorporated by reference). Those of skill in the art 
can readily determine the various parameters and conditions for producing antibody aerosols 
without resort to undue experimentation. When using antisense preparations of the mvcntion, 
slow intravenous administration is preferred. 

Preparations for parenteral administration include sterile aqueous or non-aqueous 
solutions, suspensions, and emulsions. Examples of non-aqueous solvents are propylene glycol, 
polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such as ethyl 
oleate. Aqueous carriers include water, alcoholic/aqueous solutions, emulsions or suspensions, 
including saline and buffered media. Parenteral vehicles include sodium chloride solution, 
Ringer's dextrose, dextrose and sodium chloride, lactated Ringer's or fixed oils. Intravenous 
vehicles include fluid and nutrient replenishers, electrolyte replenishers (such as those based on 
Ringer's dextrose), and the like. Preservatives and other additives may also be present such as, 
for example, antimicrobials, anti-oxidants, chelating agents, and inert gases and the like. 

The preparations of the invention are administered in eftective amounts. An effective 
amount is that amount of a pharmaceutical preparation that alone, or together with further doses, 
produces the desired response. For example, in the case of treating cancer, the desired response 
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is inhibiting the progression of the cancer. In the case of treating ossification of the hgamentum 
flavum, the desired response is inhibiting the progression of the ossificalion. This may involve 
only slowing the progression of the disease temporarily, although more preferably, it involves 
halting the progression of the disease permanently. This can be monitored by routine methods or 
can be monitored according to diagnostic methods of the invention discussed herein. 

The invention also contemplates gene therapy. The procedure for performing ex vivo 
gene therapy is outlined in U.S. Patent 5,399,346 and in exhibits submitted in the file history of 
that patent, all of which are publicly available documents. In general, it involves introduction in 
vitro of a functional copy of a gene into a cell(s) of a subject which contains a defective copy of 
the gene, and returning the genetically engineered cell(s) to the subject. The functional copy of 
the gene is imder operable control of regulatory elements which permit expression of the gene in 
the genetically engineered cell(s). Numerous transfection and transduction techniques as well as 
appropriate expression vectors are well known to those of ordinary skill in the art, some of which 
are described in PCT application WO95/00654. In vivo gene therapy using vectors such as 
adenovirus, retroviruses, herpes virus, and targeted liposomes also is contemplated according to 
the invention. 

The invention further provides efficient methods of identifying pharmacological agents or 
lead compounds for agents active at the level of a Smad6 or Smad6 fragment modulatable 
cellular function. In particular, such fiinctions include TGF-P, activin and BMP signal 
transduction and formation of a TGF-P superfamily receptor-Smad6 protein complex. Generally, 
the screening methods involve assaying for compounds which interfere with a Smad6 activity 
such as TGF-P receptor-Smad6 binding, etc. Such methods are adaptable to automated, high 
throughput screening of compounds. The target therapeutic indications for pharmacological 
agents detected by the screening methods are limited only in that the target cellular function be 
subject to modulation by alteration of the formation of a complex comprising a Sniad6 
polypeptide or fragment thereof and one or more natural Smad6 intracellular binding targets, 
such as TGF-P receptor. Target indications include cellular processes modulated by TGF-p, 
activin and BMP signal transduction following receptor-ligand binding. 

A wide variety of assays for pharmacological agents are provided, including, labeled in 
vitro protein-protein binding assays, electrophoretic mobility shift assays, immunoassays, cell- 
based assays such as two- or three-hybrid screens, expression assays, etc. I'or example, three- 
hybrid screens are used to rapidly examine the effect of transfccted nucleic acids on the 
intracellular binding of Smad6 or Smad6 fragments to specific intracellular targets. The 
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Iransfected nucleic acids can encode, for example, combinatorial peptide libraries or antisense 
molecules. Convenient reagents for such assays, e.g., GAL4 fusion proteins, are known in the 
art. An exemplary cell-based assay involves transfccting a cell with a nucleic acid encoding a 
Smad6 polypeptide fused to a GAL4 DNA binding domain and a nucleic acid encoding a TGF-P 
receptor domain which interacts with Smad6 fused lo a transcription activation domain such as 
VP 16. The cell also contains a reporter gene opcrably linked to a gene expression regulatory 
region, such as one or more GAL4 binding sites. Activation of reporter gene transcription occurs 
when the Smad6 and TGF-p receptor fusion polypeptides bind such that the GAL4 DNA binding 
domain and the VP 16 transcriptional activation domain are brought into proximity to enable 
transcription of the reporter gene. Agents which modulate a Smad6 polypeptide mediated cell 
function are then detected through a change in the expression of reporter gene. Methods for 
determining changes in the expression of a reporter gene arc known in the art. 

Smad6 fragments used in the methods, when not produced by a transfected nucleic acid 
are added to an assay mixture as an isolated polypeptide. Smad6 polypeptides preferably are 
produced recombinantly, although such polypeptides may be isolated from biological extracts. 
Recombinantly produced Smad6 polypeptides include chimeric proteins comprising a fusion of a 
Smad6 protein with another polypeptide, e.g., a polypeptide capable of providing or enliancing 
protein-protein binding, sequence specific nucleic acid binding (such as GAL4), enhancing 
stability of the Smad6 polypeptide under assay conditions, or providing a detectable moiety, such 
as green fluorescent protein or Flag epitope as provided in the examples below. 

The assay mixture is comprised of a natural intracellular Smad6 binding target such as a 
TGF-P receptor or fragment thereof capable of interacting with Smad6. While natural Smad6 
binding targets may be used, it is frequently preferred to use portions (e.g., peptides or nucleic 
acid fragments) or analogs (i.e., agents which mimic the Smad6 binding properties of the natural 
binding target for purposes of the assay) of the Smad6 binding target so long as the portion or 
analog provides binding affinity and avidity to the Smad6 fragment measurable in the assay. 

The assay mixture also comprises a candidate pharmacological agent. Typically, a 
plurality of assay mixtures are run in parallel with different agent concentrations to obtain a 
different response to the various concentrations. Typically, one of these concentrations serves as 
a negative control, i.e., at zero concentration of agent or at a concentration of agent below the 
limits of assay detection. Candidate agents encompass numerous chemical classes, although 
typically they are organic compounds. JVcferably, the cmididate pharmacological agents are 
small organic compounds, i.e., those having a molecular weight of more than 50 yet less than 
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about 2500, preferably less than about 1000 and, more preferably, less than about 500. 
Candidate agents comprise functional chemical groups necessary for structural interactions with 
polypeptides and/or nucleic acids, and typically include at least an ainine, carbonyl, hydroxyl or 
carboxyl group, preferably at least two of the functional chemical groups and more preferably at 
5 least three of the functional chemical groups. The candidate agents can comprise cyclic carbon 
or heterocyclic structure and/or aromatic or polyaromatic structures substituted with one or more 
of the above-identified functional groups. Candidate agents also can be biomolecules such as 
peptides, saccharides, fatty acids, sterols, isoprenoids, purines, pyrimidines, derivatives or 
structural analogs of the above, or combinations thereof and the like. Where the agent is a 

10 nucleic acid, the agent typically is a DNA or RNA molecule, although modified nucleic acids as 
defined herein are also contemplated. 

Candidate agents are obtained from a wide variety of sources including libraries of ' 
synthetic or natural compounds. For example, numerous means are available for random and 
directed synthesis of a wide variety of organic compounds and biomolecules, including 

15 expression of randomized oligonucleotides, synthetic organic combinatorial libraries, phage 
display libraries of random peptides, and the like. Alternatively, libraries of natural compounds 
in the form of bacterial, fijngal, plant and animal extracts are available or readily produced. 
Additionally, natural and synthetically produced libraries and compounds can be readily be 
modified through conventional chemical, physical, and biochemical means. Further, known 

20 pharmacological agents may he subjected to directed or random chemical modifications such as 
acylation, alkylation, esterification, amidification, etc. to produce structural analogs of the 
agents. 

A variety of other reagents also can be mcluded in the mixture. These include reagents 
such as salts, buffers, neutral proteins (e.g., albumin), detergents, etc. which may be used to 

25 facilitate optima] protein-protein and/or protein-nucleic acid binding. Such a reagent may also 
reduce non-specific or background interactions of the reaction components. Other reagents that 
improve the efficiency of the assay such as protease, inhibitors, nuclease inhibitors, antimicrobial 
agents, and the like may also be used. 

The mixture of the foregoing assay materials is incubated under conditions whereby, but 

30 for the presence of the candidate pharmacological agent, the Smad6 polypeptide specifically 
binds the cellular binding target, a portion thereof or analog thereof. The order of addition of 
components, incubation temperature, time of incubation, and other perimeters of the assay may 
be readily determined. Such experimentation merely involves optimi::ation of the assay 
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parameters, not Ihe fundamental composition of the assay. Incubation temperatures typically are 
between 4°C and 40°C, Incubation times preferably are minimized to facilitate rapid, high 
throughput screening, and typically are between 0.1 and 10 hours. 

After incubation, the presence or absence of specific binding between the Smad6 
polypeptide and one or more binding targets is detected by any convenient method available to 
the user. For cell free binding type assays, a separation step is often used to separate bound from 
unbound components. The separation step may be accomplished in a variety of ways. 
Conveniently, at least one of the components is immobilized on a solid substrate, from which the 
unbound components may be easily separated. The solid substrate can be made of a wide variety 
of materials and in a wide variety of shapes, e.g., microtiter plate, microbead, dipstick, resin 
particle, etc. The substrate preferably is chosen to maximum signal to noise ratios, primarily to 
minimize background binding, as well as for ease of separation and cost. 

Separation may be effected for example, by removing a bead or dipstick from a reservoir, 
emptying or diluting a reservoir such as a microtiter plate well, rinsing a bead, particle, 
chromatographic column or filter with a wash solution or solvent. The separation step preferably 
includes multiple rinses or washes. For example, when the solid substrate is a microtiter plate, 
the wells may be washed several times with a washing solution, which typically includes those 
components of the incubation mixture that do not participate in specific bindings such as salts, 
buffer, detergent, non-specific protein, etc. Where the solid substrate is a magnetic bead, the 
beads may be washed one or more times with a washing solution and isolated using a magnet. 

Detection may be effected in any convenient way for cell-based assays such as two- or 
three-hybrid screens. The transcript resulting from a reporter gene transcription assay of Smad6 
polypeptide interacting with a target molecule typically encodes a directly or indirectly 
detectable product, e.g., p-galactosidasc activity, luciferase activity, and the like. For cell free 
binding assays, one of the components usually comprises, or is coupled to. a delectable label. A 
wide variety of labels can be used, such as those that provide direct detection (e.g., radioactivity, 
luminescence, optical or electron density, etc), or indirect detection (e.g., epitope tag such as the 
FLAG epitope, enzyme tag such as horseradish peroxidase, etc.). The label may be bound to a 
Smad6 binding partner, or incorporated into the structure of the binding partner. 

A variety of methods may be used to detect the label, depending on the nature of the label 
and other assay components. For example, the label may be detected while bound to the solid 
substrate or subsequent to separation from the solid substrate. Labels may he directly detected 
through optical or electron densily radioactive emissions, nonradiative energy transfers, etc. or 
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indirectly detected with antibody conjugates, streptavidin-biotin conjugates, etc. Methods for 
detecting the labels are well known in the art. 

The invention provides Smad6-specific binding agents, methods of identifying and 
making such agents, and their use in diagnosis, therapy and pharmaceutical development. For 
example, Smad6-specific pharmacological agents are useful in a variety of diagnostic and 
therapeutic applications, especially where disease or disease prognosis is associated with 
improper utilization of a pathway involving Smad6, e.g,, TGF-P induced phosphorylation of 
Smadl or Smad2, TGF-P superfamily receptor-Smad6 complex formation, etc. Novel Smad6- 
specific binding agents include Smad6-specific antibodies and other natural intracellular binding 
agents identified with assays such as two hybrid screens, and non-natural intracellular binding 
agents identified in screens of chemical libraries and the like. 

In general, the specificity of Smad6 binding to a binding agent is shown by binding 
equilibrium constants. Targets which are capable of selectively binding a Smad6 polypeptide 
preferably have binding equilibrium constants of at least about 10^ M ', more preferably at least 
about 10** M ', and most preferably at least about 10^ M '. The wide variety of cell based and cell 
free assays may be used to demonstrate Smad6-specific binding. Cell based assays include one, 
two and three hybrid screens, assays in which Smad6-mediated transcription is inhibited or 
increased, etc. Cell free assays include Smad6-protein binding assays, immunoassays, etc. 
Other assays useful for screening agents which bind Smad6 polypeptides include fluorescence 
resonance energy transfer (FRET), and electrophoretic mobility shift analysis (BMSA). 

Various techniques may be employed for introducing nucleic acids of the invention into 
cells, depending on whether the nucleic acids are introduced in vitro or in vivo in a host. Such 
techniques include transfection of nucleic acid-CaP04 precipitates, transfection of nucleic acids 
associated with DEAE, transfection with a retrovirus including the nucleic acid of interest, 
liposome mediated transfection, and the like. For certain uses, it is preferred to target the nucleic 
acid to particular cells. In such instances, a vehicle used for delivering a nucleic acid of the 
invention into a cell (e.g., a retrovirus, or other virus; a liposome) can have a targeting molecule 
attached thereto. For example, a molecule such as an antibody specific for a surface membrane 
protein on the target cell or a ligand for a receptor on the target cell can be bound to or 
incorporated within the nucleic acid delivery vehicle. For example, where liposomes are 
employed to deliver the nucleic acids of the invention, proteins which bind to a surface 
membrane protein associated with endocytosis may be incorporated into the liposome 
formulation for targeting and/or to facilitate uptake. Such proteins include capsid proteins or 
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fragments thereof tropic for a particular cell type, antibodies for proteins which undergo 
internalization in cycling, proteins that target intracellular localization and enhance intracellular 
half life, and the like. Polymeric delivery systems also have been used successfully to deliver 
nucleic acids into cells, as is known by those skilled in the art. Such systems even permit oral 
delivery of nucleic acids. 



Examples 

Methods 

Cloning of mouse Smad6 and Northern blot , 

A mouse lung cDNA library (Stratagene) was screened with an EST clone (clone ID 
429356) as a probe. One of the clones contained the entire coding region of the mouse Smad6 
and was sequenced using an ALFred sequencer (Pharmacia Biotech) and a Sequenase 
sequencing kit (USB). Sequence analysis was done with DNASTAR (DNASTAR, Inc.). 
Human and mouse tissue blots (Clontech) were probed with the EST clone. 

Plasmids 

Mammalian expression vectors with an amino-terminal tag (FLAG or Myc) were 
constructed by inserting oligonucleotides encoding the epitope-tag sequence into pcDNA3 
(Invitrogen). The coding region of the mouse Smad6 was amplified by PCR and subcloned 
into Myc-pcDNA3 or FLAG-pcDNA3. The integrity of the products were confirmed by 
sequencing. Smadl, Smad2, Smad3 and Smad4 expression plasmids were constructed in a 
similar manner. 

AfTuiity cross-linking and immunoprecipitation 

lodination of TGF-pl (R&D Systems), activin A (gift of Y. Eto), and OPl/BMP-7 (gift 
of T.K. Sampath) and the following immunoprecipitation were performed as described 
(Okadomeet al., J, Biol Chem. 271:21687-21690, 1996). 

Western blot and in vivo phosphorylation 

COS-7 cells were transiently transfected using DMRIR-C (Gibco/BRL). 
[^^P]orthophosphate- or p^]methionine/cysteine-labeling and immunoprecipitation were dore as 
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described by Nakao et al. {J, Biol Chem, 272:2896-2900, 1997). For Western blot of the 
immunoprecipitated proteins, tagged proteins were detected by chemiluminescence (ECL, 
Amersham). 

Luciferase assays 

Mink R mutant cells were transiently transfected with an appropriate combination of a 
reporter, expression plasmids, and pcDNA3 using Tfx-50 (Promega). Total amounts of 
transfected DNA were the same in each experiment, and values were normalized using sea 
pansy luciferase activity under the control of the thymidine kinase promoter (pRL-TK, Toyo 
Ink). 

Cell cultures 

C1C12 cells, F9 cells, and ST2 cells were obtained from Riken Cell Bank (Tsukuba, 
Japan). lOTl/2 cells were from American Type Culture Collection (Bethesda, MD, USA). 
C1C12 cells and lOTl/2 cells were cultured in Dulbecco's modified Eagle's medium (DMEM) 
with 10% PBS, 100 units of penicillin and 50 fig of streptomycin per ml. F9 cells were 
cultured in DMEM with 15% PBS and the antibiotics, and ST2 cells were culmred in 
RPMI1640 with 10% FBS and the antibiotics. The cells were kept in 5% COj humid 
atmosphere at 37 °C. 

Poly(A)^ RNA isolation and Northern blotting 

Poly(A^) RNA was obtained using Oligotex dT-30 Super latex beads (Takara Shuzo 
Co., Ltd.) according to the manufacturer's method. PolyCA)"" RNA (3 /ig) from cells treated 
with BMP-2 (300 ng/ml), BMP-7/0P- 1(300 ng/ml), or TGF-pi (25 ng/ml) for various time 
periods were electrophoresed in 1% gel in the presence of 2.2 M formaldehyde gels and 
blotted to Hybond N membranes (Amersham). The complete coding region of mouse Smad6 
cDNA was labeled by [a-''^P]dCTP using Random Primer Labeling Kit (Takara Shuzo Co., 
Ltd-). Hybridization was performed in a solution containing 5 x SSC, 1% SDS, 5 x 
Denbandt's solution and 10 /ig/ml sahnon sperm DNA at 65X with 2 x SSC, 1 % SDS for 20 
min. twice, 0.5 x SSC, 1 % SDS for 30 min. 0,2 x SSC, 1 % SDS for 10 min. The filters were 
stripped by boiled distilled water containing 0.1% SDS and rehybridized. 

Example 1: Identification of Sniad6 and determination of its expression 
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In search of new members of the Smad family, several expressed-sequence tag (EST) 
sequences that are not ascribed to the five mammalian Smads characterized previously 
(Massague et al., 1997) were identified. We screened a mouse lung cUNA library using one 
of the EST clones (clone ID 429356) as a probe and identified overlapping clones encoding tlie 
same protem of 495 amino acids with a predicted molecular weight of 53.7 kDa (Fig. la, SEQ 
ID N0:2; GenBank accession number AF010133). Identical residues are boxed. The 
C-terminal amino acid sequence of the protein was almost identical with the entire protein 
sequence of a GenBank clone deposited as the human Smad6 (235 amino acids, accession 
number: U59914). It thus was concluded that our clone is the full-length mouse Smad6. 
Chromosomal localization of die human Smad6 (JVI5-1) has been reported (Riggins et al., 
Nature Genet, 13:347-349, 1996). Except for the human Smad6, all of the Smads, including , 
Drosophila and C. elegans Smads, comprise conserved N-terminal and C-terminal regions 
(MHl and MH2, respectively) separated by a proline-rich linker region of variable length and 
sequence, although Smad4 has a unique insert in its MH2 region (Massague, et al., 1997). The 
C-terminal one-third of Smad6 shares the conserved sequence with the MH2 regions of the 
other Smads, whereas its N-terminal region shows a striking difference from the conserved 
MHl sequence (Fig. la), suggesting a novel function of this molecule. 

Northern blot analysis of various human and mouse tissues, using the human EST 
clone 429356 as a probe, revealed relatively ubiquitous expression of the mRNA species of 
3.0 kh with the highest expression in lung (Fig. lb; human (left) and mouse (right)). 

Example 2: Receptor binding of Smad6 

Members of the TGF-p superfamily exert their diverse effects through binding to two 
types of receptors with serine-threonine kinase activity (Yingling et al., Riochim, Biophys. 
Acta 1242:115-136, 1995). The ligand first binds to the type 11 receptor, which consequently 
activates the type I receptor by direct phosphorylation. The activated type I receptor then 
phosphorylates ligand-specific Smads such as Smadl, Smad2, and SmadS (Massague, et al., 
1997; Zhang et al., Nature 383:168-172, 1996; Lagna et al., Nature 383:832-836, 1996; 
Macias-Silva et al., Cell 87:1215-1224, 1996). The association of Smad2 with T^R-I requires 
activation of TpR-I requires activation of TpR-1 by the type 11 receptor (TpR-II) (Maci'as-Silva 
et al-, 1996), Smad2, however, interacts with TpR-I only transiently under physiological 
conditions, since Smad2 is released from TPR-I after phosphorylation by the receptor. The 
interaction of Smad2 with the TpR-I has thus been observed only when the kinase-defective 



wo 98/56913 PCT/US98/12078 

-36" 

form of TpR-I is used (Macias-Silva et aL, 1996). It should be noted that Smad4 does not 
associate the receptors (Zhang et al., 1996). 

The interaction of Smad6 with the type I receptors was examined in affinity cross- 
linking assays. COS-7 cells were transfected with FLAG-tagged Smad6 (F-Smad6) or FLAG- 
tagged Smad2 (F-Smad2) in combination with the wild type (wt) or kinase -defective (KR) HA- 
tagged TpR-I and hexahistidine-tagged TpR-II. Cells were affinity labeled with ^'^I-TGF-pl 
and lysates were inmiunoprecipitated with anti-IIA antibody or anti-FLAG M2 antibody. 
Immune complexes were subjected to SDS-PAGE and autoradiography. Smad6 bound to both 
wild type and kinase-defective TpR-I depending on the kinase activity of TpR-II. In contrast, 
Smad2 bound to kinase-defective ipR-I but not to the wild type TpR4. Smad6 bound to the 
TGF'P receptor complexes as revealed by the coprccipitation of the receptor complexes with 
Smad6 (Fig. 2a). Similarly to Smad2, the binding of Smad6 to XpR-I required the kinase 
activity of TpR-II (Fig. 2a). Smad6, however, stably bound to the wild type TpR-I. 

Similar results were obtained with the activin type IB receptor (ActR-IB) using *^^I- 
activin A (Fig. 2B) and the BMP type IB receptor (BMPR-IB) using ^^^i-OP-I/BMP-V (Fig. 
2C) in which Smad6 bound to both wild type and kinase-defective type I receptors. These 
results suggest that Smad6 binds to the type I receptors in a ligand-dependent manner but 
exerts a role different from that of the other Smads. 

Example 3: Effect of Sniad6 on phosphorylation of Smads 

Smad2 is phosphorylated at its carboxy-terminal end by activated TpR-I (Macias-Silva 
et al., 1996). The phosphorylation is essential to the following downstream signaling events 
that culminate in transcriptional activation of the target genes, since disruption of the 
phosphorylation sites abrogated TGF-P-induced responses (Macias-Silva et al., 1996). Thus 
the effect of Smad6 on the phosphorylation of Smad2 was examined (Fig. 3). COS-7 cells 
were transiently transfected with constitutively active (TD) TPR-I, FLAG-Smad2 (F-Smad2), 
and/or Myc-Smad6 (M-Smad6). Cells were labeled with p^PJorthophosphate and lysates were 
subjected to immunoprecipitation with anti-Myc antibody, Phosphorylated Smad6 was 
detected by SDS-PAGE and autoradiography. Doublet bands of phosphorylated Smad6 were 
detected. Cell lysates also were immunoprecipitated with anti-FLAG antibody to detect Smad2 
phosphorylation. Expression levels of Smad6 (panel A), Smad2 and TpR-I (TD) (panel B) 
were monitored by labeling of the cells with p^'Sjmethionine/cysteine. 

Neither TpR-I (TD) or FLAG-Smad2 affected the phosphorylation of Smad6. 
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Phosphorylation of Sinad2 induced by constitutively active XPR-I was suppressed by Smad6 
(39% reduction as normalized for the ^^S-labeled band), whereas Sniad2 did not affect 
constitutive phosphorylation of Smad6 (Fig. 3 A, B), 

Smad3 and Smad2 share 91 % identity in their amino acid sequences and are 
independently shown to mediate TGF-p signals (Eppert et aL Cell 86:543-552, 1996; Zhang et 
al., 1996), although functional differences of the two molecules are siill unknown. A similar 
experiment as above was done with Sraad3. Smad6 rather enhanced receptor-induced 
phosphorylation of Smad3 (Fig. 3C), suggesting differential effects of Smad6 to these closely 
related molecules. 

Next, the effect of Smad6 on SmadI phosphorylation was studied (Fig. 3D), Smadl 
was phosphorylated by the constitutively active BMP type lA receptor (BMPR-IA) as well as 
BMPR-IB. Smad6 efficiently inhibited phosphorylation induced by the latter (60% reduction) 
but not by the former. These results suggest that Smad6 acts as an inliibitor to certain 
members of the Smad family. 

Example 4: Effect of Smad6 on Smad complex formation 

Smad2 heteromerizes with Smad4 upon phosphorylation by TpR-I (Lagna et al., 1996). 
It was recently shown that TGF-p also induces association of Smad2 and Smad3 (Nakao et al., 
EMBO. y. 16:5353-5362, 1997). The effect of Smad6 on the heteromerization of these Smads 
was examined (Fig. 4). COS-7 cells were transfcctcd with the indicated combination of 
plasmids and subjected to immunoprecipitation followed by Western blot detection. 
Expression levels of Smad2 (bottom), Smad4 (middle), and Smad6 (middle) were monitored. 
Note that Smad6 did not interact with Smad2 under these conditions (top). 

Smad2 formed a complex with Smad4 in the presence of constitutively active TpR-I as 
shown by coprecipitation of Smad4 with Smad2. The complex formation was abrogated by 
Smad6 (Fig. 4A, lop panel). TpR-I-induced interaction of Smad3 and Smad4, however, was 
not affected by Smad6 (Fig, 4B), as Smad4 coprecipitated with Smad3 both in the presence 
and absence of Smad6 (top). This is consistent with the result that Smad6 does not inhibit 
Smad3 phosphorylation (Fig. 3C). Furthermore, heteromerization of Smad2 and Smad3 was 
inhibited by Smad6 (Fig. 4C), suggesting that phosphorylation of both proteins is necessary 
for this interaction. These results suggest that Smad6 specifically interferes with the activation 
of Smad2 in TGF-p signaling. 
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Example 5: Effect of Smad6 on TGF-P signaling 

The role of Smad6 in TGF-P signaling was tested, as assessed by luciferase reporter 
gene assays. P3TP-Lux, a sensitive reporter for TGF-P, was used in R mutant mink cells 
deficient in TPR-I (Fig. 5). Wild type TpR-I restored TGF-p response in these cells. Mink R 
mutant cells deficient in TpR-I were transfected with p3TP-Lux reporter, TpR-I, and 
increasing amounts (/xg) of Smad6 DNA. Cells were treated with (closed bars) or without 
(open bars) 5 ng/ml TGF-P 1 for 24 h. Smad6 suppressed the activation of the reporter gene in 
a dose-dependent manner (Fig. 5a). Transcriptional activation by constitutively active TpR-1 
was suppressed as well (Fig. 5b). 

Cyclin A expression is necessary for cell cycle progression and is suppressed by TGF- 
P (Feng et al., /. BioL Chem. 270:4237-24245, 1995). A cyclin A luciferase reporter, 
pCAL2, was used to examine the effect of Smad6 on TGF-P signaling. TpR-I (consitutively. 
active) downregulated cyclin A luciferase activity, but increasing amounts of Smad6 
counteracted the effects of TpR-I in the cyclin A luciferase assay (Fig. 5c). These results 
indicate that Smad6 interfered with TGF-P signals in two distinct responses. 

Example 6: Regulation of Smad6 expression by TGF-p I and BMPs 

To determine the mechanisms by which the expression of Smad6 is controlled, the effect 
of TGF-P and other family members on Smad6 mRNA expression was examined. To determine 
the effect of BMP-2, BMP-2 was added at 300 ng/ml to the culture medium of several BMP-2- 
responsive cll lines: F9, C3H10TV2, 812 and C2C12. After 6 hours, poly A' mRNA was isolated 
from the cells as described above. Samples of mRNA were electrophoresed, blotted and probed 
with "P-labeled Smad6 coding region. The results of the Northern blot are shown in Fig. 6A. 
Expression of Smad6 was induced in all of the BMP-2 responsive cell lines after BMP treatment. 

To determine the effect of OP-l/BMP-7, the expression of Smad6 mRNA in C2C12 
myoblasts after stimulation with 300 ng/ml OP-I/BMP-7 was tested. Poly A' mRNA was 
isolated at the indicated times following OP-l/BMP-7 stimulation. As shown in Fig, 6B, Smad6 
expression was induced by OP-l/BMP-7 at 6 hours after stimulation, and remained induced up to 
at least 48 hours after stimulation, the duration of the experiment. 

To determine the effect of TGF-P K the expression of Smad6 niRNA in C2C12 myoblasts 
after stimulation with 25 ng/ml TGF-P 1 was tested. Poly A^ mRNA was isolated at the indicated 
times following TGF-p 1 stimulation. As shown in Fig. 6C. Smad6 expression was induced by 
TGF-p 1 at I hour after stimulation, but decreased thereafter. After 12 hours, the expression of 
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Smad6 mRNA was below tlie basal level of expression. 

In summary, the BMPs tested (BMP-2 and BMP-7/0P-1) induced the expression of 
Smad6 mRNA. In contrast, TGF-p 1 initially induced the expression of Smad6, but Smad6 levels 
subsequently decreased to below basal levels. Thus different members of the TGF supcrfamily 
5 exert opposing effects on the expression of Smad6, 

HOUIVALENTS 

Those skilled in the art will recognize, or be able to ascertain using no more than routine 
experimentation, many equivalents to the specific embodiments of the invention described 
10 herein. Such equivalents are intended to be encompassed by the following claims. 

All references disclosed herein are incorporated by reference in their entirety. 
A Sequence Listing is presented followed by what is claimed: 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: 

(A) NAME: LUDWIG INSTITUTE FOR CANCER RESEARQI 

(D) STREET: 1345 AVENUE OF THE AMERICAS 

(C) CITY: NEW YORK 

(D) STATE: NE^ YORK 

(E) COUNTRY: UNITED STATES OF AMERICA 

(F) POSTAL CODE: 10105 

(i) APPLICANT: 

(A) NAME: JAPANESE FOUNDATION FOR CANCER RESFARCH 

(B) STREET: 1-37-1 KAMI -IKEBUKURO, 1-CHCME, TOSHIMA-KU 

(C) CITY: TOKYO 170 
(E) COUNTRY: JAPAN 

(ii) TITLE OF THE INVENTION: SMAD6 AND USES THEREOF 

(iii) NUMBER OF SEQUENCES: 10 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Wolf, Greenfield & Sacks, P.C. 

(B) STREET: 600 Atlantic Avenue 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: U.S.A. 

(F) ZIP: 02210-2211 

(v) COMPUTER READABLE FORM: 
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(A) MEDIUM TYPE: Diskette 

(B) COMPUIER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSBQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/049,990 

(B) FILING DATE: 13-JUN-1997 

(A) APPLICATION NUMBER: 60/053,040 

(B) FILING DATE: 18 -JUL- 1997 

(A) APPLICATION NUMBER: 60/066,173 

(B) FILING DATE: 18 -NOV- 1997 

(viii) ATTORNEY/AGENT INTORMATION: 

(A) NAME: Van Amsterdam, John R. 

(B) REGISTRATION NUMBER: 40,212 

(C) REFERENCE/DOCKET NUMBER: L0461/7038WO 

(ix) TELEOO^MJNICATION INFORMATION: 

(A) lELEPHONE: 617-720-3500 

(B) TELEFAX: 617-720-2441 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID N0:1: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) NDLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 751... 2235 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

CCCCTCGAGG TCGACGGTAT CGATAAGCTT GATATCGAAT TCCl'i'llTiT TrATGGCTTC 60 

CACTCATCTG TTGACACCCG CGITCAGGAG AGACTTGCCC CAAGTGCACC GAGCGCCOGG 120 

GACCTGAGAC GGAATTGCTT TTCGTGCGTG CAAAATCCAA GCATTTrGAG TTTrGTTTGG 180 

GACCmTTC ITGCnTGCI' TTTATTTCTA I'iTITATm GTTGCAGGGA TATGGGAGIT 240 

ATCCACAAGC CITAGTTTOG GATCCTGGAG GGAAAGCCCA TGTAGCATAG CITGGCnTT 300 

GAAGGCAGAG TTGTGCAGAC ACATITQGGG GCAOGACGCA AGCGCITTGT GCTCGTGTAC 360 

CAGCCGCGCA ACnTTGAAG GCIOGCGGGC CCATGCAGGG TGTCTCTAGC ATCGTTTOGC 420 

TGGTGGCITC CCTAAGGCTC CAAAGCAGCT GGAGTTGAGC GGTCCCGGCC CATCGTGATC 480 

CATGTAGCCC GCTGGTCCCT CGCGGACTGA GGCTCAACAC GCGCGTGTTC CCGGCCCGGC 540 

CCGGCCCGGC TTGGCCCGGC GCGAGCTCCC TCATGITGCA GCCCTGCGGr GCCCCTTCGA 600 

CGACAGGCIG TTGCGCGGTC TGCAOGGCGC CCOGCGGCAG AGCITCATCT GGGGCTGCGG 660 

CCGCTCAGCC GGCGCCTCGT TGAGGSAACG GACCCCCGGT AACCGGAGAC CGCCTCCCCT 720 

CCCACCAHHH MAGGCGCCAA AGGGTATCGT ATG TTC AGG TCI AAA CGT TCG GGG 774 

Met Phe Arg Ser Lys Arg Ser Gly 
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Val Arg Leu Tip Arg ser *rg Val Val Pro aln Glu 



822 



15 



20 



Gly Ser Gly Gly Gly Gly Gly Val ^ Glu ssp Gly Ser Leu Gly &r 



870 



35 



40 



10 



15 



aaGrrG«GCCrGCCCa5 0GGGC.C«GKGGCGGAGGC™c«CCX.C 
Arg Ma Glu Pro Ma Pro Arg Ma Arg Glu Gly Gly Gly Cys Ser Arg 

«X GM GTC OGC TO. GTA GCC C03 ca. a=G ore CGG GAC GCG GIX. GGA 
S=r Glu val Arg Ser Val Ma Pr. Arg Ar^ p„ „^ 

65 

C030GAGGCGCCGO3ATCGO3GGCAa5CGC0=GCGCACAGGGOGCCK: 
Pro Arg Gly Ma Ma lie Ma Gly Arg Arg Arg Arg Thr Gly Gly Leu 



75 



80 



20 



85 



COG AGG Crc GTC TCG GAG C^C GGG GCr GGG GGC ICC CCG CIG 

Pxo Arg Pro Val Ser Glu Ser Gly Ala Gly Ala Gly Gly Ser Pro Leu 
^0 95 

100 

^5 «T ^3 «^ G« Orr GGA «3C CCA or OG CCr GAG AC^ GAC ^ 
val Ala Glu Pr. Gly Gly P„ Gly Leu Pro Glu Ser Asp Cy. 



105 



110 



115 



120 



30 



GAG ACG Gm ACC ^ CTC TTC TX:C GAA CGG GAC GCA GCA GGC GOG 

Glu Thr val Thr Cys Cys Leu Phe Ser Glu Arg Asp Ala Ala Gly Ala 



918 



966 



1014 



1062 



LllO 



1158 



135 
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CCC QGG GAC TCT GGC GAT CCC CAA GCC AGA CAG TCC CCG GAG CCG GAG 
Pro Arg Asp Ser Gly Asp Pro Gin Ala Azg Gin Ser Pro Glu Pro Glu 
140 145 

GAG GGC GGC GGG OT ax; AGT a;c GAA GCC or TCG CGA CIG CTC err 
Glu Gly Gly Gly Pro Arg Ser Arg Glu Ala Arg Ser Arg Leu Leu Leu 
155 ISO 

CTG GAG CAG GAG CTC AAG AOG GTC ACXJ TAC TOG CTG CTC AAG AQG CTC 1302 
l^u Glu Gin Glu Leu Lys Thr Val Thr l-yr Ser Leu Leu Lys Arg Leu 



170 175 



180 



AAG GAG OGT TCG CTG GAC ACG ere TTG GAG GCT GIG GAG TCC CGA GGC 
Lys Glu Arg Ser Leu Asp Thr I^u Leu Glu Ala Val Glu Ser Arg Gly 

195 200 

GGC CTA COG GGC GGC TGC GT3 CIG GTG CCG CGC GCC GAC CIC CGC TIG 
Gly Val Pro Gly Gly Cys Val Leu Val Pro Arg Ala Asp Leu Arg Leu 

205 210 215 

GGC GGC CAG CCC GOG CCA CTG CAG CIG CIG CTC GGC CGC CTC TTC CGC 
Gly Gly Gin Pro Ala Pro Pro Gin Leu Leu Leu Gly Arg Leu Phe Arg 

225 230 

TGG CCA GAC CTG CAG CAC GCA GTG GAG CTG AAA CCC .CTTC TGC GGC TGC 
Trp Pro Asp Leu Gin His Ala Val Glu Leu Lys Pro Leu Cys Gly Cys 
235 240 245 

CAC AGC rrr ACC GCC GCC GCC QAC GGG CCC ACG GTG TGT TGC AAC CCC 
His ser Phe Thr Ala Ala Ala Asp Gly Pro Thr Val Cys Cys Asn P^ 

255 260 



1254 



1350 



1398 



1446 



1494 



1542 
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m:caciTCi«:o3GcicT=CGGGcm««TO.crGccGca:crcTAT 

lyr Hi. Phe ser Arg Leu Cyo Gly Pro Glu Scr Pro Pro Pro Pro Tyr 



1590 



270 ,.. 

280 



21i 



ra- 035 crc irr CCT ccr TAc MG CCA cro cs,T ere Tcc GAT 

Ser Arg Leu Ser ftro Pro Aap Gin lyr Lys Pr. Leu Asp Leu Ser top 



1638 



285 



290 



295 



10 



15 



TUr ACA TK; TUT TAG ACT GAA ACC GAG GCC AGO AAG TGC CTC ATC ACT 
Ser Thr Leu Ser Tyr Thr Glu Thr Glu Ala Thr Asn Ser Leu lie Thr 

GCr CGG GGT GAA TTC TC^ GAT GCG AGO ATG TCT OI. GAT GCC ACC AAG 
Ala Pro Gly Glu Phe Ser Asp Ala Ser Met: Ser Pro Asp Ala Lys 



315 320 



325 



OOGAGCCBCIGGrarAGCSIGO^KCTOGAGCACCGGACACGCGre 
Pro ser His Trp Cys Ser Val Ala Tyr Trp Glu His Arg Thr Arg Val 



•^-^^ 340 



20 



GGC CGC Crc TAT GCG GTO TAC GAC CAG GCT GTC AGC ATT TIC TAC GAC 
Gly Arg Leu lyr Ala Val Tyr Asp Gin Ala Val Ser lie Phe lyr Asp 



350 



355 



360 



« ''A CCT CAG ox AGC GGC TTC TGC CI^ GGC C.G cnr AAC CK= GAG CAG 
Leu Pro Gin Gly Ser Gly Phe Cys I^u Gly Gin Leu Asn Leu Glu Gin 



370 



375 



30 



OGC AGT GAG TOG GIG 03G aSC Aa= ax AGC AAG ATX: GGT m- GGC ATA 
Arg ser Glu Ser Val Arg Arg Thr Arg Ser Lys He Gly Phe Gly lie 



1686 



1734 



.1782 



1830 



1878 



1926 



380 



385 



390 
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CTG CrC AGC AAG GAG CCA GAG GGC GTG TGG GCC TAG AAC OGG GGC GAG 1974 
Leu Leu Ser Lys Glu Pro Asp Gly Val I'rp Ala lyr Acn Arg Gly Glu 
395 400 405 

CAC GCC ATC 1TC GTG AAC TCC CCG ACG CTG GAT GOG CCC GGA GGC CGC 2022 
His Pro He Phe Val Asn Ser Pro Thr Leu Asp Ala Pro Gly Gly Arg 
410 415 420 

GCC CTG GTC GTG CGC AAG GTG CCA CCG GGT TAG TCC ATC AAG GTG TTC 2070 
Ala Leu Val Val Arg Lys Val Pro Pro Gly Tyr Ser He Lys Val Phe 
425 430 435 440 

GAC TTT GAG CGC TCA GGG CTG CTG GAG CAC GCA GAC GCC GOT CAC GGC 2118 
Asp Phe Glu Arg Ser Gly Leu Leu Gin His Ala Asp Ala Ala His Gly 
445 450 455 

CCC TAG GAC CCG CAC ACT GTG GGC ATC AGC TTC GCC AAG GGC TGG GGA 2166 
Pro Tyr Asp Pro His Ser Val Arg He Ser Phe Ala Lys Gly Trp Gly 
460 465 470 

CCC TGC TAG TGG CGA CAG TTC ATC ACC TCC TGG CCC TGT TGG CTG GAG 2214 
Pro Cys Tyr Ser Arg Gin Phe He Thr Ser Cys Pro Cys Trp Leu Glu 
475 480 485 

ATC CTA CTC AAC AAC CAC AGA TAGCAATGCG GCTGCCACTG T3CCGCAGCG TCCC 2269 
He Leu Leu Asn Asn His Arg 
490 495 

CCAACCTCTG GGGGGCCAGC GCCCAGAGAC ACCACCCCAG GGACAACCTC GCCCTCCCCC 2329 

CAGATATCAT CTACCTAGAT TIAATATAAA GITTTATATA TTATATGGAA ATATATATTA 2389 

TACITGTGGA ATTATCGAGT CAlTilTACA ACGTAATTAT TTATATATGG TGCAATGTGT 2449 

GTATATGGAG AAACAAGAAA GACGCAdTT GGCTTGTAAT TCTTTCGGAA TTCCTGCAGC 2509 



10 
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CCGGGGGATC CACTAGTTCT AGAGC 

(2) INFORMATION FOR SEQ ID N0:2; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 495 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Phe Arg Ser Lys Arg Ser Gly Leu Val Arg Arg Leu Trp Arg 

Arg Val Val Pro Asp Arg Glu Glu Gly Ser Gly Gly Gly Gly Gly Val 

20 Asp Glu Asp Gly Ser I^u Gly Ser Arg Ala Glu Pro Ala Pro Arg Ala 
35 40 45 

Arg Glu Gly Gly Gly Cys Ser Arg Ser Glu Val Azg Ser Val Ala Pro 
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15 



Ser 



50 



55 



60 



Arg Arg Pro Azg Asp Ala Val Gly Pro Arg Gly Ala Ala He Ala Gly 
25 65 



70 7c: 



Arg Arg Arg Arg Thr Gly Gly Leu Pro Arg Pro Val Ser Glu Ser Gly 

85 90 95 

Ala Gly Ala Gly Gly Ser Pro Leu Asp Val Ala Glu Pro Gly Gly Pro 
100 105 110 

30 Gly Trp Leu Pro Glu Ser Asp Cys Glu Thr Val Thr Cys cys Leu Phe 
115 120 125 

Ser Glu A.-g Asp Ala Ala Gly Ala Pro Arg Asp Ser G]y Asp Pro Gin 



2534 
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Ma Arg Gin Ser Pro clu Pro olu Glu Gly aly aly Pro Arg Ser ^ 
"5 150 ^ 

Glu Ala Arg Ser Arg l^u Leu Leu I.u Glu Gin Glu Leu Lys Thr Val 

Thr IVr ser Leu Leu Lys Arg I.u Lys Glu Arg Ser Leu Asp Ttu: Leu 

I^u Glu Ala val Glu Ser Arg Gly Gly Val P.o Gly Gly Cys Val L.u 

200 

iO val Pro Arg Ala Asp Leu Arg Leu Gly Gly Gin Pro Ala Pro Pro Gin 

210 2m 

-^^^ 220 

Leu I.U Leu oly Arg Phe Arg Pro A^p Leu Gin His Ala Val 

•^25 230 

Glu Leu Lys Pro I.u Cye Gly Cys His Ser Phe Ita Ala Ala Ma Asp 

Gly Pro val Asn Pro ,Vr His Phe Ser Arg Leu Cys Gly 

260 Pf-c 

^65 270 

Pro Glu ser Pro Pro Pro P«, ^ ser Arg Leu Ser Pro Pro Asp Gin 

280 2Q5 

20 Tyr Lys Pro Leu Asp Leu Ser Asn q^^r- t^v- t 

F i,er Asp Ser Thr Leu Ser Tyr Thr Glu Thr 

295 

Glu Ala Asn Ser Leu He T^ Ala Pro Gly Glu Phe Ser Asp Ala 

315 

■^^^ 320 
ser „e. Ser Pro Asp Ala Thr Lys ^ Ser His ^ cya Ser Val Ala 

Tyr Trp Glu His Arg Thr Arg Val Gly Arg Leu lyr Ala Val T^^ Asp 

Gin Ala Val Ser He Phe TVr Asp Leu Pro Gin Gly Ser Gly Phe Cys 

J60 3g5 
30 I^u Gly Gin l.u Asn Leu Glu Gin Arg Ser Glu ser Val Arg Arg Thr 



370 



375 



380 



ser Lys He Gly Phe Gly He r.u Leu Ser Lys Glu Pro Asp Gly 
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390 3gc: 

•^^^ 400 

Val Trp Ala TVr Asn Arg Gly Glu His Pro He Phe Val Asn Ser Pro 

Thr Leu Asp Ala Pro Gly Gly Arg Ala Leu Val Val Arg Lys Val Pro 

Pro Gly Tyr Ser He Lys Val Phe Asp Phe Glu Arg Ser Gly Leu Leu 

440 

Gin His Ala Asp Ala Ala His Gly Pro lyr Asp Pro His Ser Val Arg 

'0 lie ser Phe Ala Lys Gly Gly Pro Cys Tyr Ser Arg Gin Phe He 

^'^ 480 
Thr Ser Cys Pro Cys Trp I^u Glu He I^u I.u Asn Asn His Arg 



485 



495 



15 



20 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENCHH: 1488 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cEJm 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATGTTCAGGT CIAAACGnc GGGGCTGGIX. CGAC^cm GGCGAAGPCX. IX^TU^TCCrr ,o 

GAIO^GGAGG AAGGCAGCGG CGGCGGCGOT GGI^CGACB AGGATt^G CCIGGGCAGC 120 

CGAGCK^GC CimrCCGCG GGCACGAGAG GGCT^GGCT GCAC^CC^C CGAAGTCCGC 180 

30 ^^^TAGCCC CGO^C CCGGGAOX^ GTGGGAOrGC GAGGCXXrCGC GAIXX^GGG^ .40 

^CGGC GCACCm^ CCTCCCGAC^ CCO^Ica. AGTCGGGGGC CGGGGd^ 3^00 

OGCTCCCO^ TGGATCI^C GGAGCCIGGA GGOTCAa^ GGCTGCCTGA GAOTGACir^ 3S0 



25 



wo 98/56913 PCT/US98/12078 

- 50 - 

GAGACGGTGA CCTOCTOTCr CTTCTCCGAA CGQGAC3CAG CM^GCGCGCC CCGGGACTCT 420 

GGCGATCCCC AAGCCAGACA GTCCCQGGAG CCGGAGGAGG GCGGCGGGCC TCGGAGTCGC 480 

GAAGCCOGCT CGOGACTGCT GCTTCTGGAG CAGGAGCTCA AGAOGGTCAC GTACTOGCTG 540 

CTCAAGAGGC TCAAGGAGCG TrCGCTGGAC AOGCrGTrGG AOGCrGTGGA GTCCCXW3GC 600 

GGCGTACOOG GCXSGCTGOGT GCTGGTGCCG CGOGCCGACC TCCGCITGGG CGGCCAGCCC 660 

GCGCGACCGC AGCTGCTGCr CGGCCGCCTC TTCCGCTGGC CAGACCTGCA GCACGCAGTG 720 

GAGCTGJ^C CCCTCTXXX^ CTGCCACAGC TTTACCGCCG CCGCCGACGG GCCCACGGTG 780 

TCTTGGAACC CCTACCACIT CAGCCGGCTC TGCGGGCCAG AATCACCXXT GCCCCCCTAT 840 

TCTCGGCTGT CTCCICCLGA CCAGTACAAG CCACTGGATC TGTCCGATTC TACATTGTCT 900 

l2\GACrGAAA CGGAGGCGAC CAACTTCCCTC ATGACTGCTC CGGGTGARIT CTGAGATGCC 960 

AGCATGTCTC OGGATGCGAC CAAGCCGAGC CACTGGTGCA GCGTGGCGTA CTGGGAGCAC 1020 

CGGACACGCG TGGGCO^CCr CTATGCGGTG TAOGACCAGG CTGTCAGCAT TTTCTACGAC 1080 

CTACCTCAGG GCAGCGGCTT CTGCCTGGQC CAGCTCAACC TGGAGCAG03 CAGIGAGTO 1140 

GTGCGGCGCA CGCGCAGCAA GATOGGTITT GGGATACTGC TCAGCAAGGA GCCAGACGGC 1200 

GTGTGGGCCr ACAACOGGGG CGAGGACCCC ATCITGGTCA ACTCCCCXmC GCIGGATGCG 1260 

CCOSGAGGCC GOSCCCrGGT CCTGCGCAAG GTGCCAGCGG GTTACTCCAT CAAGGTGTTC 1320 

GACITTGAGC GCTCAGGGCT GCTGCAGCAC GCAGACGCCG CTGAOGGCCC CTACGACCCG 1380 

CACAGTGTGC GCATCmZYT CGCCAAGGGC TGGGGACCCT GCTACTCGCG ACAGTTCATC 1440 

ACCTCCTGCC CCTGITGGCr GGAGATCCTA CTCAACAACC ACAGATAG 1488 

(2) IJSIFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1270 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOIiOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 4: 
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AAAAGAACXIA ATCCAGCACC AAAACGTGCT ACAACAlTGGA TGAACTTCGA CCACATGAAA 60 

GAAGAAGCCA GCCACAAAAG GCCATATATT GTAT3AAATG AAATGTCCAG AATGGGCAAA 120 

CCCATAGAGA CACAAAAATC TCCXXCACCT CCCTACTCTC GGCTGTCTCC TCX3CGACXSAG 180 

TACAAGCCAC TGGATCTCTC CGATTCTACA TTGTCITAGA CTGAAAOGGA GGCTACCAAC 240 

TCCCrCATCA CrGCTCCX3GG TGAATTCTCA GACGCCAGCA TCTCTCOGGA CX^CCACCAAG 300 

CCJGAGCCACrr GGTGCAGOGT GGCGTACTGG GAGGACCGGA CGCGCGTGGG CCGCCTCTAr 360 

GCGGTGTACG ACCAGGCOGT CAGCATCTTC TACGACCTAC CTCAGGGCAG CGGCTTCTGC 420 

CTGGGCCAGC TCAACCTOGA GCAGCGCAGC GAGTCGGTGC GGOGAACGGG CAGCAAGATC 480 

GGCrrCGGCA TCCTGCrCAG CAAGGAGCCC GACGGCGTGT GGGCCTACAA CCGCGGCGAG 540 

CACCCGATCr TOGTGAACTC CCDSACSGCTG GAOGCGCCGG GCGGCOGGGC CCT3GT0GTC 600 

CGGAAGGTGC CCCCCGGCTA CTCCATCAAG GTGTrCGACT TCGAGCGCTC GGGCCTGCkG 660 

CACGGGCCCG AGCCOGACGC CGCCGAa3GC CCCTACGACC CCAACAGCGT CCGCATCAGC '720 

TTOGCCAAGG GCTGGGGGCC CTGCTACTCC CGGCAGTTCA TCACCTCCTG CCCCTGCTGG 780 

CTGGAGATCC TCCTCAACAA CCCCAGATAG TGGOGGCCCC GGCGGGAGGG GCX3GGrGGGA 840 

GGCCGCGGCC ACCGCCACCT GCCX3GCCrC33 AGAGGGGCCG ATGCCCAGAG ACACAGCCCC 900 

CAOGGACAAA ACCCX:CCAGA TATCATCTAC CTAGATTTAA TATAAAGTIT TATATATTAT 960 

ATGGAAATAT ATATTATACT TGTAATTATG GAGTCATTTT TAGAATGTAA TTATITATGT 1020 

ATQGTGCAAT GTGTGTATAT GGACAAAAGA AGAAAGACGC ACTnTGGCIT ATAATTCm 1080 

CAATACAGAT ATATTTTCIT TCTCTTCCTC CTTCCTCTTC CTTACTTnT ATATATATAT 1140 

ATAAAGAAAA TGATACAGCA GAGCTAGGTG GAAAAGCCTG GGTTTGGTGT ATGGTTTTTG 1200 

AGATATTAAT GCCCAGACAA AAAGCTAATA CCAGTCAGTC GATAATAAAG TATTCGGATT 1260 

ATAAAAAAGA 1270 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 235 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLCGY: linear 



(ii) MOLECOIiE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SBQ ID NO: 5: 
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10 



20 



Met Ser Arg Met Gly Lys Pro He Glu Thr Gin Lys Ser Pro Pro Pro 

5 Pro TVr ser Arg Leu Ser Pro Arg Asp Glu Tyr Lys Pro Leu Asp Leu 
2° 25 30 

Ser Asp Ser Thr I^u Ser Tyr Thr Glu Thr Glu Ala Thi- Asn Ser Leu 

35 40 45 

lie Thr Ala Pro Gly Glu Phe Ser Asp Ala Ser Met Ser Pro Asp Ala 

55 60 
Thr Lys Pro Ser His Trp Cys Ser Val Ala Tyr Trp Glu His Arg Thx 

Arg Val Gly Arg I^u Tyr Ala Val Tyr Asp Gin Ala Val Ser He Phe 
85 90 95 

•5 Tyr Asp I^u Pro Gin Gly Ser Gly Phe Cys Leu Gly Gin Leu Asn Leu 
100 105 
Glu Gin Arg Ser Glu Ser Val Arg Arg Thr Arg Ser Lys He Gly Phe 

115 120 125 

Gly He Leu I^u Ser Lys Glu Pro Asp Gly Val Ala Tyr Asn Arg 

135 140 
Gly Glu His Pro He Phe Val Asn Ser Pro Thr Leu Asp Ala Pro Gly 

145 150 ,c;c 

155 160 

Gly Arg Ala Leu Val Val Arg Lys Val Pro Pro Gly Tyr Ser He Lys 
165 170 175 

25 Val Phe Asp Phe Glu Arg Ser Gly .Leu Gin His Ala Pro Glu Pro Asp 
180 185 190 

Ala Ala Asp Gly Pro Tyr Asp Pro Asn Ser Val Arg He Ser Phe Ala 

195 200 205 

Lys Gly Trp Gly Pro Cys lyr Ser Arg Gin Phe He Thr Ser Cys Pro 



30 



210 



215 



220 



Cys Trp l^u Glu He Leu Leu Asn Asn Pro Arg 
225 230 



235 
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(2) INPORmTTON FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 465 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Asn Val Thr Ser Leu Phe Ser Phe Thr Ser Pro TVla Val Lys Arg 

15 10 15 

Leu Leu Gly Trp Lys Gin Gly Asp Glu Glu Glu Jiys Trp Ala Glu Lys 

20 25 30 

Ala Val Asp Ala Leu Val Lys Lys Leu Lys Lys Lys Lys Gly Ala Met 

35 40 45 

Glu Glu Leu Glu Lys Ala Leu Ser Cys Pro Gly Gin Pro Ser Asn Cys 

50 55 60 

Val Thr lie Pro Arg Ser Leu Asp Gly Arg Leu Gin Val Ser His Arg 
65 70 75 80 

Lys Gly Leu Pro His Val lie Tyr Cys Arg Val Trp Arg Trp Pro Asp 

85 90 95 

Leu Gin Ser His His Glu Leu Lys Pro Leu Glu Cys Cys Glu Phe Pro 

100 105 110 

Phe Gly Ser Lys Gin Lys Glu Val Cys He Asn Pro lyr His Tyr Lys 

115 120 125 

Arg Val Glu Ser Pro Val Leu Pro Pro Val Leu Val Px'o Arg His Ser 

130 135 140 

Glu Tyr Asn Pro Gin His Ser Leu Leu Ala Gin Phe Arg Asn Leu Gly 
145 150 155 160 



wo 98/56913 

PCTAJS98/12078 

-54- 

Gln Asn Glu Pro His Met Pro Leu Asn Ala Thr Phe Pro Asp Ser Phe 
165 



Gin Gin Pro Asn Ser His Pro Phe Pro His Ser Pro Asn Ser Ser Tyr 

180 185 190 

5 Pro Asn Ser Pro Gly Ser Ser Ser Ser Thr Tyr Pro His Ser Pro Thr 

195 200 205 

Ser Ser Asp Pro Gly Ser Pro Phe Gin Met Pro Ala Asp Thr Pro Pro 

210 215 220 

Pro Ala Tyr Leu Pro Pro Glu Asp Pro Met Thr Gin Asp Gly Ser Gin 

° 230 235 

'^^^ 240 

Pro Met Asp Thr Asn Met Met Ala Pro Pro Leu Pro Ser Glu lie Asn 

250 255 
Arg Gly ASP Val Gin Ala Val Ala Tyr Glu Glu Pro Lys His Tzp Cys 
260 265 270 

5 Ser He Val Tyr Tyr Glu Leu 



275 280 



Asn Asn Arg Val Gly Glu Ala Plie His 

285 

Ala ser Ser Thr Ser Val Leu Val Asp Gly Phe Thr Asp Pro Ser Asn 

295 300 
Asn Lys Asn Arg Phe Cys l^u Gly Leu Leu Ser Asn Val Asn Arg Asn 

310 

■^15 320 
ser Thr lie Glu Asn Thr Arg Arg His He Gly Lys Gly Val His Leu 

325 330 
Tyr Tyr Val Gly Gly Glu Val Tyr Ala Glu Cys Leu Ser Asp 



Ser Ser 



340 345 



350 

He Phe Val Gin Ser Arg Asn Cys Asn Tyr His Has Gly Phe His Pro 

360 365 
Thr Thr Val cys Lys He Pro Ser Gly Cys Ser Leu Lys lie 



Phe Asn 



370 



375 



380 



Asn Gin Glu Phe Ala Gin Leu Leu Ala Gin Ser Val Asn His Gly Phe 
385 

•^yb 400 
Glu Thr val lyr Glu leu Thr Lys Met Cys Thr He Arg Met Ser Phe 



405 



410 



415 
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Val Lys Gly Trp Gly Ala Glu Tyr His Arg Gin Asp Val Thr Ser Thr 

425 

Pro Cys Txp He Glu lie His Leu His Gly Pro Leu Gin Trp Leu 



Asp 



435 



440 



445 



5 Lys Val Leu Thr Gin Met Gly Ser Pro His Asn Pro He 



450 



Ser Ser Val 
460 



Ser 
465 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUEWCE CHARACTERISTICS: 

(A) LENGTH: 467 amino acids 

(B) TYPE: amino acid 

(C) STRANDECNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

20 fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met ser Ser He Leu Pro Phe Thr Pro Pro Val Val Lys Arg Leu Leu 

15 

Gly Trp Lys Lys Ser Ala Gly Gly Ser Gly Gly Ala Gly Gly Gly Glu 
Gin Asn Gly Gin Glu Glu Lys Trp Cys Gin Lys Ala Val 



Lys Ser Leu 



35 



40 



45 



Val Lys Lys I^u Lys Lys Thr Gly Arg I^u Asp Glu Leu Glu Lys Ala 

55 60 
30 He Thr Thr Gin Asn Cys Asn Thr Lys Cys Val Thr He Pro Ser Thr 

"70 75 

80 

cys ser Glu He Trp Gly Leu Ser Thr Ala Asn Thr Val Asp Gin Trp 
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85 90 95 

Asp Thr Thr Gly Leu Tyr Ser Phe Ser Glu Gin Thr Arg Ser Leu Asp 

100 105 110 

Gly Arg Leu Gin Val Ser His Arg Lys Gly Leu Pro fiis Val He Tyr 
^ 115 120 125 

Cys Arg I^u Trp Arg Trp Pro Asp Leu His Ser His His Glu Leu Lys 

130 135 140 

Ala He Glu Asn Cys Glu Tyr Ala Phe Asn Leu Lys Lys Asp Glu Val 

150 155 160 

10 Cys Val Asn Pro Tyr His Tyr Gin Arg Val Glu Thr Pro Val Leu Pro 

165 170 175 

Pro Val l^u Val Pro Arg His Thr Glu He Leu Thr Glu Leu Pro Pro 

180 185 190 

Leu Asp Asp Tyr Thr His Ser He Pro Glu Asn Thr Asn Phe Pro Ala 
5 195 200 205 

Gly He Glu Pro Gin Ser Asn Tyr He Pro Glu Thr Pro Pro Pro Gly 

210 215 220 

Tyr He Ser Glu Asp Gly Glu Thr Ser Asp Gin Gin Leu Asn Gin Ser 

230 235 240 

3 Met Asp Thr Gly Ser Pro Ala Glu Leu Ser Pro Thr Thr Lsu Ser Pro 

245 250 255 

Val Asn His Ser Leu Asp Leu Gin Pro Val Thr Tyr Ser Glu Pro Ala 

260 265 270 

Phe Trp Cys Ser He Ala Tyr Tyr Glu Leu Asn Gin Arg Val Gly Glu 

275 280 285 

Thr Phe His Ala Ser Gin Pro Ser Leu Thr Val Asp Gly Phe Thr Asp 

290 295 300 

Pro Ser Asn Ser Glu Arg Phe Cys Leu Gly Leu Leu Ser Asn Val Asn 

310 315 320 

Arg Asn Ala Thr Val Glu Met Thr Arg Arg His He Gly Arg Gly Val 

325 330 335 

Arg Leu Tyr Tyi- He Gly Gly Glu Val Phe Ala Glu Cys Leu Ser Asp 
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340 345 350 

Ser Ala He Phe Val Gin Ser Pro Asn Cys Asn Gin Arg Tyr Gly Trp 

355 360 365 

His Pro Ala Thr Val Cys Lys He Pro Pro Gly Cys Asn Leu Lys He 
5 370 375 380 

Phe Asn Asn Gin Glu Phe Ala Ala Leu Leu Ala Gin Ser Val Asn Gin 

390 3.95 400 

Gly Phe Glu Ala Val Tyr Gin Leu Thr Arg Met Cys I'hr He Arg Met 
405 410 
10 Ser Phe Val Lys Gly Trp Gly Ala Glu Tyr Arg Arg Gin Thr Val Tiir 
420 425 430 

Ser Thr Pro Cys Trp He Glu Leu His Leu Asn Gly Pro Leu Gin Trp 

435 440 445 

L^u Asp Lys Val Leu Thr Gin Met Gly Ser Pro Ser Val Arg Cys Ser 
'5 450 455 460 

Ser Met Ser 
465 



(2) INFORMATION FOR SEQ ID NO: 8: 



20 



(i) SEQUEasrCE CHARACTERISTICS: 

(A) LENGTH: 425 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



30 



Met Ser Ser He Leu Pro Phe Thr Pro Pro He Val Lys Arg Leu Leu 

15 10 15 
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Gly Trp Lys Lys Gly Glu Gin Asn Gly Gin Glu Glu Lys Trp Cys Glu 

20 25 30 

Lys Ala Val Lys Ser Leu Val Lys Lys Leu Lys Lys Thr Gly Gin Leu 
35 40 45 

5 Asp Glu Leu Glu Lys Ala lie Thr Thr Gin Asn Va], Asn Thr Lys Cys 
50 55 60 

He Thr He Pro Arg Ser Leu Asp Gly Arg Leu Gin Val Ser His Arg 

70 75 3^ 

Lys Gly Leu Pro His Val He Tyr Cys Arg Leu Trp Arg Trp Pro Asp 

85 90 95 

I^u His Ser His His Glu I^u Arg Ala Met Glu Leu Cys Glu Phe Ala 

100 105 
Phe Asn Met Lys Lys Asp Glu Val Cys Val Asn Pro Tyr His Tyr Gin 
115 120 125 

5 Arg Val Glu Thr Pro Val Leu Pro Pro Val Leu Val Pro Arg His Thr 
130 135 140 

Glu He Pro Ala Glu Phe Pro Pro Leu Asp Asp Tyr Ser His Ser He 

1^5 150 Ti-c- 

^ 155 160 

Pro Glu Asn Thr Asn Phe Pro Ala Gly He Glu Pro Gin Ser Asn He 
^ 165 170 175 

Pro Glu Thr Pro Pro Pro Gly lyr Leu Ser Glu Asp Gly Glu Thr Ser 

180 185 
Asp His Gin Met Asn His Ser Met Asp Ala Gly Ser Pro Asn Leu Ser 

195 200 205 

Pro Asn Pro Met Ser Pro Ala His Asn Asn Leu Asp Leu Gin Pro Val 

210 215 220 

Thr Tyr cys Glu Pro Ala Phe Trp Cys Ser He Ser Tyr Tyr Glu Leu 



225 230 

^■^^ 235 240 

Asn Gin Arg Val Gly Glu Thr Phe His Ala Ser Gin Pro Ser Met Thr 

245 250 255 

Val Asp Gly Phe Thr Asp Pro Ser Asn Ser Glu Arg Phe Cys Leu Gly 
260 265 270 
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Leu Leu Ser Asn Val Asn Arg Asn Ala Ala Val Glu Leu Thr Arg Arg 

275 280 285 

His He Gly Arg Gly Val Arg Leu Tyr Tyr He Gly Gly Glu Val Phe 

290 295 300 

Ala Glu Cys Leu Ser Asp Ser Ala He Phe Val Gin Ser Pro Asn Cys 
305 310 315 

Asn Gin Arg Tyr Gly Trp His Pro Ala Thr Val Cys Lys He Pro Pro 

325 330 335 

Gly Cys Asn Leu Lys He Phe Asn Asn Gin Glu Phe Ala Ala Leu Leu 

340 345 350 

Ala Gin Ser Val Asn Gin Gly Phe Glu Ala Val Tyr Gin Leu Thr Arg 

355 360 365 

Met Cys Thr He Arg Met Ser Phe Val Lys Gly Trp Gly Ala Glu Tyr 

370 375 380 

Arg Arg Gin Thr Val Thr Ser Thr Pra Cys Trp He Glu Leu His Leu 
385 390 395 4OO 

Asn Gly Pro Leu Gin Trp Leu Asp Lys Val Leu Thr Gin Met Gly Ser 

405 410 415 

Pro Ser He Arg Cys Ser Ser Val Ser 
420 425 

(2) INFORMATION FOR SBQ ID NO: 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 552 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUEtJCE DESCRIPTION: SEQ ID NO: 9: 
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Met Asp Asn Met Ser lie Thr Asn Thr Pro Thr Ser Asn Asp Ala Cys 
Leu Ser He Val His Ser Leu Met Cys His Arg Gin Gly Gly Glu Ser 



20 



25 



30 



5 Glu Thr Phe Ala Lys Arg Ala He Glu Ser Leu Val Lys Lys Leu Lys 



35 40 



45 



Glu Lys Lys Asp Glu Leu Asp Ser Leu lie Thr Ala lie Thr Thr 



50 55 



Asn 

60 

Gly Ala His Pro Ser Lys Cys Val Thr lie Gin Arg Thr Leu Asp Gly 

Arg Leu Gin Val Ala Gly Arg Lys Gly Phe Pro His Val lie lyr Ala 

85 90 95 

Arg Leu Tip Arg Trp Pro Asp I^u His Lys Asn Glu Leu Lys His Val 
100 105 no 

5 Lys Tyr Cys Gin Tyr Ala Phe Asp I^u Lys Cys Asp Ser Val Cys Val 
115 125 
Asn Pro Tyr His Tyr Glu Arg Val Val Ser Pro Gly lie Asp Leu Ser 

130 135 140 

Gly Leu Thr Leu Gin Ser Asn Ala Pro Ser Ser Met Met Val Lys Asp 
145 150 



155 



160 



Glu Tyr Val His Asp Phe Glu Gly Gin Pro Ser Leu Ser Thr Glu Gly 

165 170 175 

His Ser lie Gin rhr lie Gin His Pro Pro Ser Asn Arg Ala Ser Thr 

180 185 190 

Glu Thr Tyr Ser Thr Pro Ala Leu Leu Ala Pro Ser Glu Ser Asn Ala 

195 200 205 

Thr ser Thr Ala Asn Phe Pro Asn He Pro Val Ala Ser Thr Ser Gin 

215 220 
Pro Ala Ser He Leu Gly Gly Ser His Ser Glu Gly Leu Leu Gin Tie 
225 230 235 240 

Ala Ser Gly Pro Gin Pro Gly Gin Gin Gin Asn Gly Phe Thr Gly Gin 
245 250 255 
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Pro Ala Thr lyr His His Asn Ser Thr Thr Thr Trp Thr Gly Ser Arg 

265 270 
Thr Ala Pro Tyr Thr Pro Asn Leu Pro His His Gin Asn Gly His Leu 
275 280 285 

5 Gin His His Pro Pro Met Pro Pro His Pro Gly His Tyr Trp Pro Val 



295 



His Asn Glu Leu Ala Phe Gin Pro Pro He Ser Asn His Pro Ala Pro 
305 310 

320 

Glu Tyr Trp Cys Ser He Ala Tyr Phe Glu Met Asp Val Gin Val Gly 

330 335 
Glu Thr Phe Lys Val Pro Ser Ser Cys Pro lie Val Thr Val Asp Gly 

345 350 
Tyr Val Asp Pro Ser Gly Gly Asp Arg Phe Cys Leu Gly Gin Leu Ser 

360 365 
■5 Asn Val His Arg Thr Glu Ala He Glu Arg Ala Arg Leu His lie Gly 

375 380 
Lys Gly Val Gin Leu Glu Cys Lys Gly Glu Gly Asp Val Val Arg 

390 395 

jyb 400 

Cys Leu Ser Asp His Ala Val Phe Val Gin Ser lyr Tyz- Leu Asp Arg 

Glu Ala Gly Arg Ala Pro Gly Asp Ala Val His Lys He Tyr Pro Ser 

420 425 430 

Ala Tyr He Lys Val Phe Asp Leu Arg Gin Cys His A^ Gin Met Gin 
435 440 445 

25 Gin Gin Ala Ala Thr Ala Gin Ala Ala Ala Ala Ala Gin Ala Ala AJa 

455 

val Ala Gly Asn He Pro Gly Pro Gly Ser Val Gly Gly He Ala Pro 

470 

^ /t. 4Q0 
Ala He Ser I^u Ser Ala Ala Ala Gly He Gly Val Asp Asp Leu Arg 
" 485 490 495 

Arg Leu Cys He I^u Arg Met Ser Phe Val Lys Gly Trp Gly Pro Asp 



500 505 



510 
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^ Pro Arg Gin Ser lie Lys Glu Thr Pro Cys Trp He Glu He His 

520 

Leu His Arg Ala Leu Gin I^u Leu Asp Glu Val Leu His Thr Met Pro 



530 cor 



5 lie Ala Asp Pro Gin Pro Leu 
545 550 



Asp 



10 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENCTTH: 465 amino acicJs 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

20 Met Thr ser Met Ala Ser Leu Phe Ser Phe Thr Ser Pro Ala Val Lys 

Arg Leu Leu Gly Trp Lys Gin Gly Asp Glu Glu Glu Lys Trp Ala Glu 

Lys Ala val Asp Ala Leu Val Lys Lys I^u Lys Lys Lys Lys Gly Ala 

^5 40 45 

Met Glu Glu I^u Glu Lys Ala I^u Ser Ser Pro Gly Gin Pro Ser Lys 

cys Val Thr He Pro Arg Ser I^u Asp Gly Arg Leu Gin Val Ser Hxs 

30 Arg Lys Gly I^u Pro His Val He lyr Cys Arg Val Trp Arg Trp Pro 

85 90 ,5 

■ASP Leu Gin Ser His His Glu Leu Lys Pro Leu Asp He Cys Glu Phe 



25 
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100 



105 



110 



Pro Phe Gly Ser Lys Gin Lys Glu Val Cys He Asn Pro 'lyr His Tyr 

115 120 125 

Lys Arg Val Glu Ser Pro Val I^u Pro Pro Val Leu Val Pro Arg His 

135 

Asn Glu Phe Asn Pro Gin His Ser Leu Leu Val Gin Phe Arg Asn Leu 

Ser His Asn Glu Pro His Met Pro Gin Asn Ala Thr Phe Pro Asp Ser 
165 170 
10 Phe His Gin Pro Asn Asn Ala Pro Phe Pro Leu Ser Pro Asn Ser Pro 
180 185 190 

TVr Pro Pro Pro Pro Ala Ser Ser Thr Tyr Pro Asn Ser Pro Ala Ser 

195 200 205 

Ser Gly Pro Gly Ser Pro Phe Gin Leu Pro Ala Asp Thr Pro Pro Pro 

210 215 220 

Ala Tyr Met Pro Pro Asp Asp Gin Met Ala Pro Asp Asn Ser Gin Pro 

235 

Met Asp Thr Ser Ser Asn Met He Pro Gin Thr Met Pro Ser He Ser 

250 255 
20 ser Arg Asp Val Gin Pro Val Ala 'lyr Glu Glu Pro Ly. His Trp Cys 

265 270 
Ser He Val Tyr Tyr Glu Leu Asn Asn Arg Val Gly Glu Ala Phe His 

275 280 285 

Ala ser Ser Thr Ser Val Leu Val Asp Gly Phe Thr Asp Pro Ser Asn 

290 295 
Asn Lys Ser Arg Phe C^s L^u Gly l^u Leu Ser Asn Val Asn Arg Asn 

310 31C 

•^15 320 

ser Thr lie Glu Asn Thr Arg Arg His He Gly Lys Gly Val His Leu 
325 330 335 

30 Tyr Tyr Val Gly Gly Glu Val Tyr Ala Glu Cys Leu Ser Asp Ser Ser 
340 345 
He Phe val Gin Ser Arg Asn Cys Asn Phe His His Gly Phe His Pro 
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365 

Thr Thr Val Cys Lys He Pro Ser Ser Cys Ser Leu Lys He Phe Asn 

375 

Gin Glu Phe Ala Gin Leu Leu Ala Gin Ser 

385 



Asn Gin Glu Phe Ala Gin Leu Leu Ala Gin Ser Val Asn His Gly Phe 

Glu Ala val lyr Glu Leu Thr Lys Met Cys Thr He Arg Met Ser Phe 

410 

Val Lys Gly Trp Gly Ala Glu Tyr His Arg Gin Asp Val Thr Ser Thr 

425 

Pro cys Trp He Glu He His I^u His Gly Pro Leu Gin Leu Asp 



435 



Lys Val Leu Thr Gin Met Gly Ser Pro Leu Asn Pro 11 



e Ser Ser Val 



450 



455 



460 
Ser 

465 
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CLAIMS 

1. An isolated nucleic acid molecule selected from the group consisting of 

(a) nucleic acid molecules which hybridize under stringent conditions to a molecule 
consisting of the nucleic acid sequence of SEQ ID NO: 1 and which code for a polypeptide which 
inhibits TGF-p superfamily signalling, 

(b) nucleic acid molecules that differ from the nucleic acid molecules of (a) in codon 
sequence due to the degeneracy of the genetic code, and 

(c) complements of (a) and (b). 

2. The isolated nucleic acid molecule of claim 1, wherein the isolated nucleic acid molecule 
comprises the nucleic acid sequence of SEQ ID N0:3. 

3. The isolated nucleic acid molecule of claim 2, wherein the isolated nucleic acid molecule 
consists of the nucleic acid sequence of SEQ ID N0:3. 

4. The isolated nucleic acid molecule of claim I, wherein the isolated nucleic acid molecule 
consists of the nucleic acid sequence of SEQ ID NO: 1 . 

5. An isolated nucleic acid molecule selected from the group consisting of (a) a unique 
fragment of SEQ ID N0:3 between 20 and 1487 nucleotides in length, and (b) complements of 
"(a)", provided that the nucleic acid molecule excludes sequences consisting only of SEQ ID 
N0:4 or fragments thereof. 

6. The isolated nucleic acid molecule of claim 5, wherein the isolated nucleic acid molecule 
is a unique fragment of SEQ ID N0:3 selected from the group consisting of at least 22 
contiguous nucleoddes, at least 24 contiguous nucleotides, at least 26 contiguous nucleotides, at 
least 28 contiguous nucleotides, at least 30 contiguous nucleotides, at least 35 contiguous 
nucleotides, and at least 40 contiguous nucleotides. 

7. The isolated nucleic acid molecule of claim 5, wherein the isolated nucleic acid molecule 
consists of between 20 and 32 contiguous nucleotides. 
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8. An expression vector comprising the isolated nucleic acid molecule of claims I, 2, 3, 4, 5, 
6, or 7 operably linked to a promoter. 

9. A host cell transformed or transfected with the expression vector of claim 8. 

1 0. A transgenic non-human animal comprising the expression vector of claim 8. 

11. The transgenic non-human animal of claim 10, comprising a conditional Smad6 
expression vector. 

12. A transgenic non-human animal which has reduced expression of Smad6 nucleic acid 
molecules. 

13. The transgenic non-human animal of claim 12, comprising a Smad6 gene disrupted by 
homologous recombination. 

14. The transgenic non-human animal of claim 12, comprising a conditional Smad6 gene 
disruption, 

15. The transgenic non-human animal of claim 12, comprising a trans-acting negative 
regulator of Smad6 expression. 

1 6. An isolated polypeptide encoded by the isolated nucleic acid molecule of claim 1,2,3, or 
4, wherein the polypeptide has TGF-P superfamily signalling inhibitory activity. 

17. The isolated polypeptide of claim 16, wherein the isolated polypeptide consists of the 
amino acid sequence of SEQ ID N0:2. 

1 8. The isolated polypeptide of claim 1 6, wherein the isolated polypeptide consists of a 
functional fragment or variant. 

19. An isolated polypeptide which binds selectively a polypeptide encoded by the isolated 
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nucleic acid molecule of claim 1, 2, 3, or 4, provided that the isolated polypeptide is not a 
polypeptide selected from the group consisting of TGF-P receptor, activin receptor and BMP 
receptor. 

20. The isolated polypeptide of claim 19, wherein the isolated polypeptide binds to an 
epitope defined by a polypeptide consisting of the amino acid sequence of SEQ ID N0;2, 

21 . The isolated polypeptide of claim 19, wherein the isolated polypeptide is an antibody 
fragment selected from the group consisting of a Fab fragment^, a Ffab)^ fragment or a fragment 
including a CDR3 region selective for a Smad6 polypeptide. 

22. The isolated polypeptide of claim 19, wherein the isolated polypeptide is a monoclonal 
antibody, a humanized antibody or a chimeric antibody. 

23 . An isolated complex of polypeptides comprising: 

a TGF-P superfamily receptor or receptor complex, bound to a polypeptide as claimed in 
claim 16. 

24. The isolated complex of polypeptides of claim 23, wherein the polypeptide as claimed in 
claim 16 consists of the amino acid sequence of SEQ ID N0:2. 

25. The isolated complex of claim 23, wherein the receptor or receptor complex is selected 
from the group consisting of TPRI, BMPR-IA, BMPR-IB, AclR-IA, a complex of TpRI and 
TpRII, a complex of BMPR-IA and BMPR-II, a complex of BMPR-IB and BMPR-II, a complex 
of ActR-IA and BMPR-Il and a complex of ActlMA and ActR-11. 

26. A method for reducing TGF-p superfamily signal transduction in a mammalian cell, 
comprising 

contacting the mammalian cell with an amount of the isolated nucleic acid molecule of 
claim 1 or an expression product thereof effective to reduce TGF-P superfamily signal 
transduction in the mammalian cell. 



wo 98/56913 PCT/US98/12078 

- 68 - 

27. A method for reducing phosphorylation of a pathway restricted Smad polypeptide in a 
mammalian cell, comprising 

contacting the mammalian cell with an amount of the isolated nucleic acid molecule of 
claim 1 or an expression product thereof effective to reduce phosphorylation of the pathway 
restricted Smad polypeptide in the mammalian cell. 

28. The method of claim 27, wherein the pathway specific Smad polypeptide is selected from 
the group consisting of Smad I and Smad2. 

29. A method for modulating the proliferation of a cancer cell, the proliferation of whi^h is 
altered by TGF-p superfamily signal transduction, comprismg 

contacting the cancer cell with an amount of an isolated nucleic acid molecule of claim 1 
or an expression product thereof, effective to modulate the proliferation of the cancer cell. 

30. A method of increasing TGF-p superfamily signal transduction in a cell comprising 
contacting the cell with an amount of an agent that selectively binds to an isolated nucleic 

acid molecule of claim 1 or an isolated Smad polypeptide expression product thereof, effective to 
increase the 1 GF-p superfamily signal transduction in the cell. 

3 1 . The method of claim 30 wherein the TGF-P superfamily signal transduction is mediated 
by a TGFp superfamily ligand selected from the group consisting of TGF-P 1, activin, Vgl, 
BMP^ and BMP-7. 

32. The method of claim 31, wherein the agent is an antisense nucleic acid. 

33. The method of claim 3 1 , wherein the agent is a polypeptide. 

34. The method of claim 33, wherein the polypeptide is selected from the group consisting of 
an antibody which binds an epitope defined by a polypeptide comprising the amino acid 
sequence of SEQ ID N0:2 and a dominant negative variant of a Smad6 polypeptide. 



35. A composition comprising: 
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the polypeptide of claim 16, and 

a pharmaceutically acceptable carrier. 



36. A method for decreasing Smad6 TGF-p superfaniily inhibitory activity in a subject 
5 comprising 

administering to a subject in need of such treatment an agent that selectively binds to an 
isolated nucleic acid molecule of claim 1 or an expression product thereof, in an amount effective 
to decrease Smad6 TGF-P superfamily inliibitory activity in the subject. 

» 

10 37. The method of claim 36 wherein the TGF-P superfamily inhibitory activity is inhibition •* 
of TGF-P superfamily signal transduction mediated by a TGFp superfamily ligand selected from 
the group consisting of TGF-pi, activin, Vgl, BMP-4 and BMP-7. 

38. The method of claim 37, wherein the agent is an antisense nucleic acid. 

15 

39. The method of claim 37, wherein the agent is a polypeptide. 

40. The method of claim 39, wherein the polypeptide is selected from the group consisting of 
an antibody which binds an epitope defined by a polypeptide comprising the amino acid 

20 sequence of SEQ ID NO:2 and a dominant negative variant of a Smad6 polypeptide. 

41 . A method for treating a condition characterized by abnormal BMP activity, comprising 
administering to a subject in need of such treatment an effective amount of Smad6 or a 

Smad6 agonist or antagonist sufficient to restore the BMP activity to nomiaL 

25 

42. The method of claim 41, wherein the condition is selected from the group consisting ol' 
ossification of tlie posterior longitudinal ligament and ossification of the ligament flavum. 



30 



43. A method for treating a condition characterized by abnormal TGF-P activity, comprising 

administering to a subject in need of such treatment an effective amount of Smad6 or a 
Smad6 agonist or antagonist sufficient to restore the TGF-P activity to normal. 
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44. The method of claim 43, wherein the condition is a fibrotic condition selected from the 
group consisting of liver fibrosis, kidney Fibrosis, lung fibrosis, skin fibrosis, arterial fibrosis, 
central nervous system fibrosis, and other fibrotic diseases. 

45. The method of dim 44, wherein the fibrotic condition is selected fi-om the group 
consisting of cirrhosis, veno-occlusive disease, glomerulonephritis, diabetic nephropathy, 
allograft rejection, HIV nephropathy, idiopathic fibrosis, autoimmune fibrosis, systemic 
sclerosis, keloids, hypertrophic bum scars, eosinophilia-myalgia syndrome, vascular restenosis, 
atherosclerosis, intraocular fibrosis, rheumatoid arthritis and nasal polyposis. 

46. A method for modulating the expression of cyclin A, comprising 

contacting a cell with Smad6 or an agonist or antagonist thereof, in an amount effective 
to modulate the expression of cyclin A. 

47. The method of claim 46, wherein the cell is contacted with Smad6 or an agonist thereof, 
and wherein the expression of cyclin A is increased 

48. The method of claim 46, wherein the cell is contacted witli an antagonist of Smad6, and 
wherein the expression of cyclin A is decreased. 

49. The method of claim 48, wherein the antagonist of Smad6 is selected from the group 
consisting of antibodies to Sraad6, dominant negative variants of Smad6 and Smad6 antisense 
nucleic acids. 

50. A method for identifying lead compounds for a phannacological agent useful in the 
diagnosis or treatment of disease associated with Smad6 TGF-P superfamily inhibitory activity, 
comprising 

forming a mixture comprising a Smad6 polypeptide, a receptor or receptor complex for a 
TGF-P superfamily member, and a candidate pharmacological agent, 

incubating the mixture under conditions which, in the absence of the candidate 
pharmacological agent, permit a first amount of specific binding of the TGF-p superfamily 
receptor or receptor complex by the Smad6 polypeptide, and 
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detecting a lest amount of the specific binding of the TGF-p superfamily receptor or 
receptor complex by the Smad6 polypeptide, wherein reduction of the test amount of specific 
binding relative to the first amount of specific binding indicates that the candidate 
pharmacological agent is a lead compound for a pharmacological agent which disrupts the 
Smad6 TGF-P superfamily signal transduction inhibitory activity, and wherein increase of the 
test amount of specific binding relative to the first amount of specific binding indicates that the 
candidate pharmacological agent is a lead compound for a pharmacological agent which 
enhances the Smad6 TGF-P superfamily signal transduction inhibitory activity. 

5 1 . The method of claim 50, wherein the member of the TGF-p superfamily is selected from . 
the group consisting of TGF-p 1, activin, BMP-4 and BMP-7. 

52. A method for identifying lead compounds for a pharmacological agent useful in the 
diagnosis or treatment of disease associated with Smad6 TGF-P superfamily signal transduction 
inhibitory activity, comprising 

forming a mixture comprising a Smad6 polypeptide, a receptor or receptor complex for a 
TGF-P superfamily member, a pathway restricted Smad polypeptide and a candidate 
pharmacological agent, 

incubating the mixture under conditions which, in the absence of the candidate 
pharmacological agent, permit a first amount of TGF-P superfamily member induced 
phosphorylation of the pathway restricted Smad polypeptide, and 

detecting a test amount of TGF-P superfamily member induced phosphorylation of the 
pathway restricted Smad polypeptide, wherein reduction of the test amount of the 
phosphorylation of the pathway restricted Smad polypeptide relative to the first amount of the 
phosphorylation of the pathway restricted Smad polypeptide indicates that the candidate 
pharmacological agent is a lead compound for a pharmacological agent which enhances the 
TGF-P superfamily signal transduction inhibitory activity of Smad6 and wherein increase of the 
test amount of the phosphorylation of the pathway restricted Smad polypeptide relative to the 
first amount of the phosphorylation of the pathway restricted Smad polypeptide indicates that the 
candidate pharmacological agent is a lead compound for a pharmacological agent which disrupts 
the TGF-p superfamily signal transduction inhibitory activity of Smad6. 
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53. The method of claim 52, wherein the pathway restricted Smad polypeptide is selected 
from the group consisting of Smadl and Smad2. 

54. The method of claim 52, wherein the member of the TGF-P superfamily is selected from 
5 the group consisting of TGF-p I , acti vin, BMP-4 and BMP-7. 

55. A method for increasing phosphorylation of Smad3 in a mammalian cell, comprising 

contacting the mammalian cell with an amount of the isolated nucleic acid molecule of 

claim 1 or an expression product thereof effective to increase phosphorylation of Smad3 in tlie 

.« 

10 mammalian cell. 

56. A method for reducing heteromerization of Smad2 with Smad3 or Smad4 in a 
mammalian cell, comprising 

contacting the mammalian cell with an amount of the isolated nucleic acid molecule of 
15 claim 1 or an expression product thereof effective to reduce heteromerization of Smad2 with 
Smad3 or Smad4 in the mammalian cell. 
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